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Abstract — At present, the scale of data in many cloud 
applications increases tremendously in accordance with the Big 
Data trend, thereby making it a challenge for commonly used 
software tools to capture, manage and process such large-scale 
data within a tolerable elapsed time. In big data applications, 
data privacy is one of the most concerned issues because 
processing large-scale privacy-sensitive data sets often requires 
computation power provided by public cloud services. As a result 
is challenge for existing anonymization approaches to achieve 
privacy preservation on privacy-sensitive large-scale data sets 
due to their insufficiency of scalability. In this paper we propose 
a scalable Advanced Bottom up generalization approach for data 
anonymization based on Map Reduce on cloud. To make full use 
of the parallel capability of Map Reduce on cloud, specializations 
are required in an anonymization process. Original datasets are 
split up into a group of smaller datasets, and these datasets are 
anonymized in parallel, producing intermediate results. Then, the 
intermediate results are merged into one, and further 
anonymized to achieve consistent k-anonymous data sets. A 
group of MapReduce jobs are deliberately designed and 
coordinated to perform specializations on data sets 
collaboratively. 

Keywords- cloud; Data Anonymization; MapReduce; Bottom Up 
Generalization. 

I. Introduction 

Big data is a popular term used to describe the 
exponential growth and availability of data, both structured 
and unstructured. Big data is the term of datasets so large and 
complex that it becomes difficult to process using hand 
database management tools or traditional data processing 
applications. Big Data processing is performed through a 
programming paradigm known as MapReduce. Typically, 
implementation of the MapReduce paradigm requires 
networked attached storage and parallel processing. The fact is 
that with so much data being generated by so many 
organizations and users, storage and security simply have to 
become critical business issues. Ninety per cent of the total 
data in the world today has been created in the past two years, 
and 2014 and beyond will see us generating exponentially 
larger levels of data. So with more data comes greater threat of 
attack and greater need for security. 



Cloud computing is a model for enabling convenient, 
on-demand network access to a shared pool of configurable 
computing resources (e.g., networks, servers, storage, 
applications, and services) that can be rapidly provisioned and 
released with minimal management effort or service provider 
interaction. Cloud Computing is not a very new concept in IT, 
in fact Cloud Computing is a more advanced version of the 
Data Processing Service Bureaus that we had 40 years ago. 
The biggest and best known Cloud Computing providers 
include Amazon with EC2, Microsoft with Azure and Google 
with Google Apps (e.g. Gmail, Google Docs, and Google 
Calendar). However, as the shape of the cloud computing is 
emerging and developing rapidly both conceptually and in 
reality, the legal/contractual, economic, service quality, 
interoperability, security and privacy issues still pose 
significant challenge. A task of the utmost importance is to 
develop a secure way for data in a hostile environment so that 
the published data remain practically useful while individual 
privacy is preserved. Cloud computing offers the promise of 
big data implementation to small and medium sized 
businesses. 

Privacy is one of the most concerned issues in cloud 
computing, and the concern aggravates in the context of cloud 
computing although some privacy issues. An on-line cloud 
health service, aggregates data from users and shares the data 
with research institutes. Data privacy can be revealed with less 
effort by malicious cloud users or providers because of the 
failures of some traditional privacy protection measures on 
cloud. This can bring considerable economic loss or severe 
social reputation impairment to data owners. Hence, data 
privacy issues need to be addressed urgently before data sets 
are analyzed or shared on cloud. 

We creatively apply Map Reduce on cloud to BUG 
for data anonymization and deliberately design a group of 
innovative Map Reduce jobs to concretely accomplish the 
generalizations in a highly scalable way. Secondly, introduce a 
scalable Advanced BUG approach, which performs 
generalization on different partitioned data set and the 
resulting intermediate anonymizations are merged to find final 
anonymization which is used to anoymize the original data set. 
Results show that our approach can significantly improve the 
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scalability and efficiency of BUG for data anonymization over 
existing approaches. 
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Figure 1 : The Cloud System model 

II. Related Work and Problem Ananlysis 
A. Problem Ananlysis 

A wide verity of privacy models and anonymization 
approaches have been put forth to preserve the privacy 
sensitive information in data sets. Data privacy is one of the 
most concerned issues because processing large-scale privacy- 
sensitive data sets often requires computation power provided 
by public cloud services for big data applications. We studied 
the scalability issues of existing BUG approaches when 
handling big data-data sets on cloud. Most exiting algorithms 
exploit indexing data structure to assist the process of 
anonymization, specifically TEA (Taxonomy Encoded 
Anonymity) index for BUG. TEA is a tree of m levels. The i th 
level represents the current value for Dj. Each root to-leaf path 
represents a qid value in the current data table, with a (qid) 
stored at the leaf node. In addition, the TEA index links up the 
qids according to the generalizations that generalize them. The 
data structure TEA proposed in [239] can handle only a single 
QID. A new structure is required if the data holder wants to 
achieve LKC privacy using the Bottom-Up Generalization 
method because LKC privacy in effect is equivalent to 
breaking a single QID into multiple QIDs. 




Figure 2: The TEA Structure for QID 
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B. Related Work 

Although indexing data structures can speed up the 
process of data anonymization and the generalization process, 
because indexing structure avoids frequently scanning entire 
data sets and storing statistical results circumvents re- 
computation overheads, these approaches often fail to work in 
parallel or distributed environments like cloud systems 
because the indexing structures are centralized. There is an 
assumption that all data processed should fit in memory for the 
centralized approaches. Unfortunately, this assumption often 
fails to hold in most data-intensive cloud applications 
nowadays. Thus concluding that, the centralized approaches 
are difficult in handling large-scale data sets well on cloud 
using just one single VM even if the VM has the highest 
computation and storage capability. As in bottom up search 
strategy for finding optimization works well when the value of 
k is small. Centralized BUG lacks in high performance for 
certain value of k-anonymity parameter if they are used 
individually. In BUG, Calculating the ILPG and generalizing 
the data set involve accessing a large number of data records, 
thereby dominating the scalability and efficiency of Bottom- 
up Generalization. When generalize the information and 
privacy requirements to the problems of centralized 
anonymization and distributed anonymization, and identify the 
major challenges that make traditional data anonymization 
methods not applicable and suffer from scalable problem. 
MapReduce has been widely adopted in various data 
processing applications to push upward the scalability and 
efficiency. The MapReduce framework is scalable and fault- 
tolerant because each node in the cluster is expected to report 
back periodically with completed work and status updates. If a 
node remains silent for longer than the expected interval, a 
master node makes note and re-assigns the work to other 
nodes. As to BUG, the existing approach make full use of 
indexing data structure to promote efficiency, thereby falling 
short of high scalability and parallelization in cloud 
environments. 

Thus, it is valuable in investigating how to develop 
BUG algorithm with MapReduce in order to improve the 
scalability and efficiency. We also attach MapReduce to 
improve scalability and efficiency in our research on big data 
anonymization. A scalable advanced Bottom-Up 
Generalization (BUG) approach for data anonymization based 
on Map Reduce on cloud will make full use of the parallel 
capability of Map Reduce on cloud, specializations required in 
an anonymization process and the scalability and efficiency of 
centralized BUG are improved significantly over existing 
approaches. 

III. Methodology 

Bottom-Up Generalization is an efficient k-anonymization 
method. In a k-anonymous data set, each record is 
indistinguishable from at least k-1 other records with respect 
to QID. Basically, Bottom-Up Generalization (BUG) approach 
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of anonymization is an iterative process starting from the 
lowest anonymization level. We leverage the 
information/privacy trade-off as the search metric for our 
approach, i.e., the Information Loss per Privacy Gain (ILPG). 
In this section we elaborate the Advanced BUG and 
MapReduce Bottom up Generalization (MRBUG) Driver. The 
Advanced BUG consists of following steps, data partition, run 
the MRBUG Driver on partitioned data set, combining the 
anonymization levels of the partitioned data set and applying 
generalization to original data set with integrated 
anonymization level without violating the k-anonymity. 

A. Outline of Advanced Bottom Up Generalization 

We propose an Advanced Bottom-Up Generalization 
(BUG) approach to improve the scalability and performance of 
BUG in advance fashion. The function of our approach is 
based on the two levels of parallelization provisioned by 
MapReduce on cloud. Basically, MapReduce on cloud has two 
levels of parallelization, i.e., job level and task level. Job level 
parallelization means that multiple MapReduce jobs can be 
executed simultaneously to make full use of cloud 
infrastructure resources. Task level parallelization refers to 
that multiple mapper/reducer tasks in a MapReduce job are 
executed simultaneously over data partitions. In advanced 
BUG, the dataset spit up is done by parallelized multi job 
mapreduce, and then the data set is anonymized by 
MapReduce Bottom up Generalization (MRBUG) Driver 
without violating the k-anonymity. Then all the intermediate 
anonymization levels are integrated, ensures that the integrated 
intermediate anonymized level never violates K-anonymity, 
the more general one is selected as the integrated one. After 
obtaining the merged intermediate anonymization level, we 
execute the driver on the entire original data set, and get the 
resulting anonymization level. Then, the data set is 
anonymized by replacing original attribute values in it with the 
responding domain values obtained by the resulting 
anonymization level. The procedure for Advanced Bottom Up 
Generalization is as follows. 

Input: Dataset DS, Anonymity k, k , partition p, record r, r 
DS, Anonymization Level AL 0 

Procedure: 
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4. Merge all AL l ',- where the resulting AL 2 should general 

or identical. 

5. Run MRBUG Driver with DS, k, AL 2 as parameters, 
results AL* 

6. For each attribute value v, in r, find gen in AL* 

7. Generate r*=( q h q 2 , q m ), q is quasi-identifier, m 
is number of attributes 

8. If gen is INACTIVE then emit (r* , count) 

9. For each r sum — > £ count 

10. Emit (r*,sum) 



B. Data Split Ups 

Here the original data set DS is partitioned into smaller 
ones. Let DS t , where l<i<p, denote the data sets partitioned 
from DS, where p is the number of partitions. Specifically, a 
random number ran, where 1< ran < p, is generated for each 
data record. A record is assigned to the partition DS mn . 

Anonymization is not invulnerable counter measures 
that compromise current anonymization techniques can expose 
protected information in released datasets. After getting the 
partitioned data set DS„ we run MRBUG (DS, , k 1 , AL") on 
these data sets in parallel to derive intermediate anonymization 
levels AL'j where 1< i<p. 

C. Integrating the Partitioned Data 

Here the intermediate anonymization levels are merged 
into one (AL 2 ). The merging of anonymization levels is 
completed to ensure that the merged intermediate 
anonymization level {AL 2 ) never violates privacy 
requirements, the more general and identical one is selected as 
the merged one. If the intermediate anonymization levels AL 1 ',■ 
satisfies privacy requirements, then the merged anonymization 
level AL 2 will not violate the privacy requirements. Then, 
MRBUG can further anonymize the entire data sets to produce 
final ^-anonymous data sets. 



1. Scan DS and Generate a Random number ran, 

Emit (ran, r) 

2. For each ran 

Emit (null, list(r)) 

3. Run MRBUG Driver with DS h k 1 , AL" as parameters 
where l<i<p, results AL 1 1 



D. MRBUG Driver 

MRBUG plays a main role in the Advanced BUG approach, 
as it is invoked by two times to concretely process 
generalization. Basically, a practical MapReduce program 
consists of Map and Reduce functions, and a Driver that 
coordinates the macro execution of jobs. Each round of BUG 
iteration includes four major steps, namely, checking the 
current data set whether satisfies the anonymity requirement, 
calculating the ILPG, finding the best generalization and 
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generalizing the data set according to the selected best 
generalization. An existing approach uses indexing data 
structure but MapReduce does not support indexing data 
structure. So for calculating ILPG we use mapreduce jobs. The 
procedure for MRBUG driver is as follows. 

Input: Data Set DS, Anonymity k, Data record (ID n r), r 
DS, Anonymization Level AL 0 

Procedure: 

1. Scan DS and initialize search metric ILPG 

2. For each(if Anonymity <k before generalization) 
Find gen test and set it INACTIVE 

3. if set of gen and its siblings are INACTIVE then 
Insert gen „ m , to replace INACTIVE ones 

4. AL i+1 <— ALj ; Update search metric ILPG for all 
active gen 

5. Repeat the iteration 

The MRBUG Driver starts by initiating the search metric 
values ILPG, next it checks for current anonymity satisfies the 
privacy requirements, the driver finds the best generalization 
of Highest ILPG and set genb est as INACTIVE that means 
genb es t is not consider for next iteration. If set of all gen and its 
siblings are labeled as INACTIVE then insert new higher level 
generalization to replace all the INACTIVEs. At last reset new 
value for anonymization level and update the search metric 
ILPG. For initiating and updating the search metric ILPG 
involves accessing to the original data set and computing 
statistic information over the data set. The process of 
generalization also involves accessing to the original data set. 
The information loss of a generalization will not be affected 
when we perform other generalizations or insert a new 
generalization, while privacy gain will probably be impacted 
as the anonymity of the data set will change. The essential of 
computing anonymity of a data set is to find out the minimum 
Ql-group size. The ILPG calculation results in Information 
gain, anonymity for generalizations and intermediate key 
value pair (key, count). Map and Reduce, defined over a data 
structure named key-value pair {key, value). Map takes a pair 
(M, vl) as input and then outputs another intermediate key- 
value pair (k2, v2). These intermediate pairs are consumed by 
the Reduce function as input. Reduce takes intermediate k2 
and all its corresponding values list (v2) as input and outputs 
another pair (k3, v3). 

E. Data Generalization 

The original data set is concretely generalized for data 
anonymization by a MapReduce job. The Map function emits 
anonymous records and its count according to the current 
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anonymization level. The Reduce function simply aggregates 
these anonymous records and counts their number. An 
anonymous record and its count represent a Ql-group, and the 
Ql-groups constitute the final anonymous data sets. 



DatasetDS 



Dataset Split ups 



Run MRBUG Driver 



Merge Anonymization levels 



Run MRBUG Driver 



Apply generalization 



Figure 3: The Flow Diagram 

IV. Sample Evaluvation 

To evaluate the effectiveness and efficiency of the 
Advanced BUG approach, we compare Advanced BUG with 
centralized BUG. We denote the execution time of the two 
approaches as Adv-BUG and cen-BUG, respectively. Our 
experiments are conducted in a cloud environment named 
Amazon Elastic MapReduce. Amazon Elastic 
MapReduce (EMR) is a web service that uses Hadoop, an 
open-source framework, to quickly & cost-effectively process 
vast amounts of data. Elastic MapReduce is a web service 
built on top of the Amazon cloud platform. All approaches are 
implemented in Java and standard Hadoop MapReduce API. 
We utilize the Health Care Sample data set from US 
Government Open Data Projects (Dataset). 

We measure the change of execution time Cen-BUG 
and Adv-BUG with respect to data size. We can see from 
Fig. 4 that the execution time of the Advanced approach is 
kept under a certain level, while centralized BUG incur high 
execution time when the data increases in size. Hence, Cent- 
BUG suffers from scalability problem for large-scale data sets. 
The above experimental results demonstrate that our approach 



4 



http://sites.google.com/site/ijcsis/ 
ISSN 1947-5500 



can significantly improve the scalability and efficiency 
compared with Centralized approaches. 



6000 
4000 
2000 - 
0 



■Avd-BUG 
•Cen-BUG 



0 1000 2000 3000 



X-axis: data size in MB; Y-axis: time in seconds 
Figure 3: Change of execution time with respect to data size 

V. CONCLUTION 

In this paper, we studied the scalability problem of data 
anonymization for big data applications on cloud using 
Bottom Up Generalization (BUG) and proposed scalable 
Advanced Bottom Up Generalization. The proposed BUG 
performed as Data partitioned, executing the driver producing 
intermediate results. Then, the intermediate results are merged 
and generalization is applied to produce anonymized data 
without violating ^-anonymity. The MapReduce Framework is 
effectively applied on cloud for data anonymization and shows 
that scalability and efficiency of centralized BUG are 
improved significantly over existing approaches. In future 
optimized balanced scheduling strategies are expected to be 
developed towards overall scalable privacy preservation aware 
dataset scheduling. And also our method is designed for 
achieving k-anonymity; it can be modified to adopt the LKC- 
privacy model in order to accommodate the high-dimensional 
data. 
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Abstract — A command control system refers to a general 
control system, of a production system, of a process, or of another 
type of dynamic system, wherein the control elements are not 
centralized, but are distributed throughout the system; and each 
component or sub- system is controlled by one or more controllers. 

DCS (Distributed Control System) is a computerized control 
system used to control the production line in the industry 

The entire controller system is connected via networks 
communication and monitoring. DCS is a very broad term used in 
various industries, to monitor and control distributed equipment. 

Nowadays, there are generally an information system upstream 
command control systems. To ensure the security of a system, we 
must secure the weakest link of this system. Thus, a flaw in the 
information system can lead to a multitude of command control 
system vulnerabilities. 

This is considered as a real threat to the states [1] and can lead 
to disasters [2]. A multitude of vulnerabilities currently affects 
these systems. These vulnerabilities are exploited for various 
reasons such as cybercrime, industrial espionage or other. 

The objective of our work is to analyze the workings of Stuxnet 
and to propose countermeasures to strengthen the security of 
command control systems. 

Key Words— Stuxnet, Strong authentication, SCADA, HOTP. 

I. INTRODUCTION 

Since its discovery in June 2010, the Stuxnet virus has made a 
lot of ink flow; not only for the sophisticated manner in which 
it is designed, but also because of the nature of the systems 
attacked, considered as critical, i.e., systems Supervisory 
Control And Data Acquisition (SCADA), which are essential in 
the industrial world. 



That's why; a large number of studies on the functioning of 
Stuxnet were carried out. They give more details on the 
vulnerabilities exploited, in order to infiltrate and to spread in 
systems. 

To better understand the attack, we will analyse the functioning 
of Stuxnet, which will allow us to identify gaps and different 
access systems; and thereafter to propose a countermeasure able 
to prevent this kind of attack. 

In our paper, we used Boolean logic modeling Driven Markov 
Process (BDMP) Stuxnet which was conducted by researchers 
[3] to quantify and calculate the probabilities of successful 
attacks. This allow us to better identify risk areas for 
suggesting the best countermeasures 

This model allow us to (i) have a disambiguation, (ii) have a 
better readability and conciseness, (iii) study the behavior of the 
system before and after implementation of countermeasures, 
and evaluate the added value of our solution. 

Our paper is organized as follows: In Section II we present the 
state of the art on Stuxnet. In Section III we present the 
functioning of Stuxnet. Section IV is dedicated to proposed 
countermeasures. Section V, present a qualitative study of our 
proposal, and we finish with a conclusion and perspectives. 



II. STATE OF THE ART 

In this chapter, we describe the Stuxnet virus then we present 
the BDMP formalism used to modeling the Stuxnet. After, we 
introduce the systems attacked by this virus, and we finish by 
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presenting the methods used to protect systems with strong 
authentication using the HMAC ONE TIME PASSWORD 
established by the Open Authentication consortium. 

A. Boolean logic Driven Markov Process (BDMP) 

BDMP is a graphical modeling formalism originally 
designed for safety and reliability assessment of systems. This 
formalism is an extension of fault trees used to represent Markov 
graphs of large size . It therefore provides a good readability, a 
good hierarchical representation and an advanced quantification 
capabilities. The formalism BDMP offers, features dynamic 
modeling with a particular type of link called "triggers." 

Fig. 1 shows an example of modeling BDMP 



Successful attack 





Attacker's objective 




Step I 



Step 2 



Step I - Alternative 1 Step I - Alternative 2 

Figure 1: Modeling BDMP 

B. Stuxnet virus: 

Stuxnet is a virus developed to attack information systems 
using SCAD A networks exploiting. This virus has infected 
thousands of machines, based on a new technique of attack. This 
is achieved by the use of trusted certificates, allowing it, to be 
executed in legitimacy in windows OS. The functioning of the 
virus is presented bellow. 

After his appearance, many countermeasures were made, 
like the proposal done by Siemens WinCC SCADA System 
editor, who offer a tool to remove Stuxnet. Microsoft have also, 
corrected the vulnerabilities exploited by the virus (for 
possessing the administrator's rights) [9]. But the exploitation of 
any zero-day vulnerabilities may lead to a same result. If one of 
the exploited Windows vulnerabilities is not corrected, it will 
allow modifying the executed processes, and this is considered 
as a real danger. 

C. Supervisory Control And Data Acquisition SCADA: 

The networks SCADA is a set of computers and applications 
that perform key functions in the delivery of services and 
commodities (electricity, natural gas, etc) and offers remote 
management on a large scale distributed at measures and 
controls [5]. 



D. Open Authentication OATH: 

It is a consortium in which there are large companies driven 
by the strong growth which aims to improve safety in a world 
increasingly connected. It provides solutions that enable strong 
authentication for all users on all devices and networks [6]. 

E. Hashed Message Authentication Code One Time Password 
HOTP: 

HMAC OTP (HOTP) is an algorithm to generate the one- 
time-password value based on Hashed Message Authentication 
Code (HMAC) [7]. This algorithm is used in a wide variety of 
networks and systems application. We give an example of the 
implementation of this algorithm in chapter IV. 

F . Dynamic Link Library DLL: 

DLLs are executable files that allow applications to share 
code to perform one or more predefined functions. One of the 
biggest advantages is that a DLL is not loaded into memory, 
which saves memory while allowing several programs to 
function effectively. However, there are malicious DLL. These 
programs can change the parameters of the systems and allow to 
intruders to remotely access to these latter. 

III. FUNCTIONING OF STUXNET 

Generally, the machines are infected by execution of 
autorun.inf or by file sharing. The machines are infected by 
loading a DLL file. This file contain a multiple functions which 
are exploited later by Stuxnet, subsequently, the virus checks the 
machine's configuration and its environment, in order to choose 
the continuation or discontinuation of scripts based on the 
machine configuration. 

If the configuration of the system is correct and updated; 
Stuxnet verifies if it possesses the administrator rights, if not, it 
exploits one of the two zero-day vulnerabilities in order to get 
the required rights [9]: (i) CVE- 2010-2743 (MS -10- 073) Or 
(ii) vulnerability in the Windows Task Scheduler. 

Both vulnerabilities allow any program to have privileges, 
and also, to be run in a new process or as a new task in the task 
manager. The environment is now ready to be infected by 
Stuxnet. The infection process is done according to the antivirus 
installed. 

Stuxnet installs the device drivers in the registry, to ensure 
their performances every time the machine started. They are 
loaded before all Windows applications. 

Then it changes the settings of the Windows Firewall (Windows 
Defender) to avoid being blocked later [10]. The Fig. 2 shows the 
different steps of that operation. 
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Figure 3: Hook DLL Functioning 



Figure 2 : STUXNET functioning Diagram 

Stuxnet uses a special method to load the DLL; normally the 
function LoadLibrary loads the DLL from the ROM and not 
from RAM. But Stuxnet is able to load the DLL from RAM. 
This operation is possible thanks to a special method explained 
below. 



Stuxnet is based on WriteProcessMemory or 
CreateRemoteThread [12] functions that write data in an area of 
memory in a specified process. 

The Fig.4 illustrates the method used by Stuxnet to load the 
DLL: 



Stuxnet is installed by loading in a process the Stuxnet 
modules from the explorer.exe 



He overwrites the old functions that contains the process and 
writes its own functions, after that, it patch APIs processes 
modifying their names, and then writes the parameters of the 
new features in the MZ header. 

These operations make believe that the DLL is loaded from 
the ROM. 
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is the exploitation of WriteProcessMemory or 
CreateRemoteThread; that allow to Stuxnet to write in a process 
running, and change the APIs. 

In this paper we propose the use of the OTP whenever we 
want to change any process executed by the system. In this case, 
an OTP is requested, and only after the authentication, process 
may perform changes. Otherwise, the operation fails. 

The diagram in Fig. 5 shows at what level we can introduce 
the countermeasure. 
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Figure 4: Stuxnet Loading DLL 

One of the weaknesses of the system is the lack of 
verification of the entity identity which aims to change the 
process, thats why we propose to increase the level of security 
using authentication at this point. 

IV. COUNTERMEASURES 

We propose to use a multi-factor authentication as solution. 
This multi- factor authentication makes the systems more secure 
in case a virus exploits vulnerabilities in order to modify 
functions in the process already executed. The virus would not 
be able to use all the features of the machine if it does not have 
the OTP generated by the token: The use of machine resources 
requires a OTP for access to resources. This solution aims to 
secure: (i) Access to resources, (ii) Writing in files, (iii) 
Execution of applications (iv) Changes in the DLL. 

In Fig. 3 and Fig.4, we explained how Stuxnet is installed on 
machines, and one of the main steps that allows this installation 



f Call the desired (unctions J 



Call:hs -res .MS') * 3 lf.iitttor 
itssse he litraiy-CcdL as': 



D 



Figure 5: Loading DLL with OTP 

The generation process of the HOTP is based on the 

algorithm defined in the RFC 4226. We have studied the 

behavior of Stuxnet for knowing at what level the 
implementation of this solution is the most effective. 
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We have therefore proposed a modification of both 
WriteProcessMemory function and CreateRemoteThread 
function of windows systems. 

The syntax of WriteProcessMemory function that writes 
data to a memory area in a specified process coded in C + + is: 

BOOL WINAPI WriteProcessMemory ( 

_In_ HANDLE hProcess, 

_In_ LPVOID lpBaseAddress, 

_In_ LPCVOID lpBuffer, 

_In_ SIZE 
T nSize, 

_Out_ SIZE_T *lpNumberOfBytesWritten 

) ; 

We proceed to modify the function by adding an OTP 
authentication: 

BOOL WINAPI WriteProcessMemory ( 



In 


HANDLE hProcess, 


In 


LPVOID lpBaseAddress, 


In 


LPCVOID lpBuffer, 


In 


SIZE T nSize, 


In 


LPSTR hOtp, 


Out_ 


SIZE T *lpNumberOfBytesWritten 



) ; 

The same changes are applied on the CreateRemoteThread 
function: 

HANDLE WINAPI CreateRemoteThread ( 

_In_ HANDLE hProcess, 

_In_ LPS ECURITY_ATTRI BUTE S 
lpThreadAttributes , 

_In_ SIZE_T dwStackSize, 

_In_ LPTHREAD_START_ROUTINE 
IpS tart Address , 

_In_ LPVOID lpParameter, 

_In_ DWORD dwCreationFlags, 

_Out_ LPDWORD lpThreadld 

) ; 

Changed to: 

HANDLE WINAPI CreateRemoteThread ( 

_In_ HANDLE hProcess, 

_In_ LPS ECURITY_ATTRI BUTE S 
lpThreadAttributes , 

_In_ SIZE_T dwStackSize, 

_In_ LPTHREAD_START_ROUTINE 
IpStartAddress , 



_In_ LPVOID lpParameter, 
_In_ DWORD dwCreationFlags, 
_In_ LPSTR hOtp, 
_Out_ LPDWORD lpThreadld 

) ; 

In the event that the OTP is not correct, an error is triggered 
and stops the further modification process. 

As explained in RFC 4226 [7], the HOTP is based on a 
incrementing counter (C) and a secret key (K) shared between 
the machine and the device (token). An example of the 
implementation of HOTP with a python code is given in Fig.. 



V. QUALITATIVE STUDY OF THE ESTABLISHMENT OF 

A OTP 

We have seen in Fig. 5 that the use of various system 
vulnerabilities allows to Stuxnet (or other type of worm 
exploiting the same vulnerabilities) to damage a system, 
especially in the context of a targeted attack. We proposed a 
solution based on RFC 4226. In this chapter we are going to 
evaluate the contribution of the proposed countermeasure, for 
know what our proposal brings to systems. Also, to know at 
what kind of security requirements it meets. 

Our choice was made on the evaluation made by the method 
of Common Criteria [13]. 

Consider the case where the entities that contain SCADA 
systems are the Target Of Evaluation TOE. The evaluation we 
do concern the requirements of security, such as: availability, 
confidentiality and integrity. In our case the proposed 
countermeasure that is OTP (RFC 4226) for secure against 
modification in the started process, increases availability 



# ! /usr/bin/env python 

OATH HOTP + TOTP Implementation in python. 

Based on http : / /tools . ietf . org/html/r f c422 6 

Parameter and function names kept inline with the 
RFC 

(e.g. HOTP, Truncate, K, C etc) 

import hmac 
import hashlib 

def H0TP(K, C, digits=6) : 

C_bytes = _long_to_byte_array (C) 
hmac_shal = hmac . new (key=K,msg=C_bytes, 

digestmod=hashlib . shal ) . hexdigest ( ) 

return Truncate (hmac_shal ) [-digits:] 
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because the right resource can be accessed only by the 
appropriate entity. 

Fig. 6 illustrates these high level concepts and relationships 
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Figure 6: Security concepts and relationships 

Our approach allow to have a protection profile different 
from the profile without OTP, which not only improves the level 
of system availability, but also the integrity, because the 
information cannot be changed only by entities entitled [14]. 

The Fig.7 show the method used to evaluate the 
countermeasures. 
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Figure 8: Comparison of protection profile with and without OTP 

The three criteria are different, and their improvement is 
ensured by different methods but the improving of one of the 
criteria has an impacts on the improvement of the other criteria 
systematically. [14]. 

VI. Conclusion 

Viruses, as Stuxnet continue to spread, exploiting technical 
and humans vulnerabilities. In this paper we treated 
countermeasures that block any viruses based on a loading of the 
DLL. In future work we are going to make a quantitative study 
of the proposed solution. Then, we aim prepare a set of tools to 
detect and prevent against this type of attack 
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Figure 7: Evaluation concepts and relationships 

The table below shows the impact of the establishment of a 
security solution with OTP on the various parameters of the 
requirements of security: confidentiality, integrity and 
availability. 
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Abstract-- Service oriented architecture integrated 
with text mining allows services to extract 
information in a well defined manner. In this paper, 
it is proposed to design a knowledge extracting 
system for the Ocean Information Data System. 
Deployed ARGO floating sensors of INCOIS 
(Indian National Council for Ocean Information 
Systems) organization reflects the characteristics of 
ocean. This is forwarded to the OIDS (Ocean 
Information Data System). For the data received 
from OIDS, pre-processing techniques are applied. 
Pre-processing involves the header retrieval and 
data separation. Header information is used to 
identify the region of sensor, whereas data is used 
in the analysis process of Ocean Information 
System. Analyzed data is segmented based on the 
region, by the header value. Mining technique and 
composition principle is applied on the segments 
for further analysis. 

Index Terms— Service oriented architecture; Text 
Mining; ARGO floating sensor; INCOIS; OIDS; 
Pre-processing. 

I. INTRODUCTION 

Text mining is described as the process of 
deriving high quality information from text. High 
quality information is derived from the patterns and 
trends. Text mining involves the processes of 
structuring the input text, deriving patterns from the 
structured data and finally the interpretation and 
evaluation of output. 

View of text mining is an extension of 
data mining or knowledge from structured 
databases. Knowledge discovery from textual 
database refers generally to the process of 
extracting interesting or non-retrieval patterns or 
knowledge from unstructured text documents. 

In today's competitive market, information 
is one of the main managerial assets since its 
analysis helps in effective steering. The concept of 
text mining is used in E-learning Web Miner 
Application [1]. It is a graphical user interface built 
with several operational modes for its users. For 



each mode of operation separate template was 
designed using Java. 

Data mining technique in attrition analysis 
[2] is to identify a group of customers who have a 
high probability to attrite, and then the company 
can conduct marketing campaigns to change the 
behaviour in the desired direction. Nowadays the 
majority of large companies and corporations have 
to a greater or lesser extent a Data Warehouse and 
they use OLAP tools to extract and analyze the 
information which allows them to stand themselves 
in the market. 

However, although there are areas where 
data mining techniques are being used more and 
more, such as business [3], marketing, education, 
banking, health and security systems, and so on , 
their use is still not generalized. This is mainly due 
to the fact that data mining projects need highly 
qualified professionals (expert data miners) to 
achieve, in reasonable time, useful results for 
business. One of the reasons for which expert data 
miners are required is that the knowledge discovery 
in databases (KDD) process involves multiple 
stages, and regretfully, in each one, there are a 
large number of decisions that have to be taken. 

Data mining have induced interest [4] 
among the business community, particularly in 
large corporations with strong collection of data 
about their customers and business operations. 
Increased concentration towards business 
applications has necessitated even more 
requirements for knowledge discovery projects. 

Extraction of information from the 
unstructured text [5], which will help all kinds of 
users. Information extraction typically is performed 
in the form of analysis pipelines. The pipeline 
stages are formant conversion, sentence splitting, 
tokenization, word stemming and annotation of 
tokens. 

Mining technique in decision support 
system [6] is described with the temporal data. In 
this relationships between the events that affect the 
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decision are discussed. This relationship is defined 
using the unsupervised learning technique of data 
mining. It not just defines the relationships between 
events, also extracts the interesting patterns and 
boundaries present in the system. The two methods 
of mining technique are supervised learning and 
unsupervised learning technique. Supervised 
learning technique is used in predictive statistical 
techniques whereas unsupervised learning method 
does not use dependent variables. It searches for 
the patterns and events. 

This paper is organized in the following 
sections. Section 2 involves the system description 
and section 3 describes proposed work of the 
system. Section 4 gives the details of simulation 
results and finally Section 5 projects the 
conclusion. 

n. SYSTEM DESCRIPTION 

A system modelled to service all 
community is introduced in this paper. The 
concepts used to achieve this system are text 
mining and service oriented architecture. Services 
provided by the proposed system are fishing zone 
advisory and tsunami alerts. This can be performed 
by the analysis of ocean information. Ocean 
information is retrieved from the OIDS of INCOIS 
organisation. Data from the OIDS is pre-processed 
with some constraints as adding precision points to 
the processed data. Pre-processing is literally to 
minimize the difficulties in analysis process. 
Measurement of index value and pattern extraction 
becomes easier with the pre-processed data. From 
the measured values a graphical representation is 
obtained, which clearly states the system's 
performance. 

IH. PROPOSED WORK 

Processing steps discussed in this paper 
are as follows: Pre-processing, Analysis, 
Exploration and Interpretation. Pre-processing step 
is generally defined as the data preparation process 
In this paper pre-processing step is for converting 
the hexa-decimal data to decimal data and adding 
precision values to the data. 

Analysis process of this system involves 
the calculation of ocean index value. Index value is 
to represent the nature and mining technique is to 
extract interesting patterns of the system. Formula 
used to calculate the index value is, 

N = 1.3247 - 2.5 X KTT 2 + S(2 X 1(T 4 - 8 X 

10 -7 r) + H?£_H^ (i) 

p2 pt V I 

The parameters used in the index 
calculation are described as N is the oscillation 
index, T is the ocean temperature, S is the salinity 



of ocean water and P is the pressure value. 
Application of mining technique is known as the 
extraction process. As we are processing the ocean 
data, unsupervised learning data technique is 
applied in the here. 

After the calculation of ocean index, 
association rule mining algorithm (an example of 
unsupervised learning method) is applied. It works 
by calculating the confidence and support metrics 
of the given data. Before this calculation, given 
data set is segmented by the time lag into events. 
The support of the rule is the number of times the 
rule holds same event in the database whereas 
confidence rate of an episode is calculated as, 

r =x [win a] => \|/ [win c] (2) 

The parameters win a, win c denotes the 
window values, \|/, % are time delay values which 
are estimated as the time difference between the 
similar patterns. The interpretation technique is 
used to deploy the extracted results. Here 
composition technique is used as an interpretation. 
Composition is the principle of service orientation 
and is used to compose the results of different 
operating systems. The analysis results of the ocean 
data are composed using this principle. The 
composition process gives us the details of fishing 
zone and tsunami alerts respective to the areas. 
This result is documented in a table format with the 
necessary fields, denoting the analysis process. 

IV. IMPLEMENTATION 

A. Data Set 

INCOIS is an acronym for Indian National 
Centre for Ocean Information System. It is an 
organization of central ministry department. This 
organization aims to provide a system which has 
the current analysis of ocean characteristics. 

To achieve this it deploys ARGO floating 
sensors in the ocean, which reads the physical 
characteristics like temperature, salinity and 
pressure. The characteristic values are transmitted 
to OIDS (Ocean Information Data System) through 
satellites. Now the data received on OIDS is 
forwarded for the analysis process. In fig. 1 the 
sample input dataset is given. 




IJO 7C 70 EE 



Fig.l Dataset from OIDS 
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This dataset contains the sensor header 
value and data. Here data denotes the values of 
temperature, salinity and pressure. This data is pre- 
processed for the analysis process. Pre-process step 
involves the sensor header value extraction and 
data separation which is explained in Fig.2. 




MySQL 5.5 Corr 



Client 



55:18 0 . G91 76.559 

55:18 0.671 76.557 

55:18 0.691 76.559 

57.799 ! 4050.8 
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35.348 ! 6162.5 

55:18 0.691 76. 559 

62.350 ! 1483.2 

55:18 0.691 76. 559 

55:18 0.691 761559 

35.281 ! 19. 0 




Fig.2 Separated Header and Data 

Data is separated from the set and added 
with precision values in the pre-processing step. 
Also the header extracted from the set is used to 
distinguish the sensors deployed. After the 
separation data is segmented into regions using its 
header value. It is explained in Fig. 3. 





Fig. 5 Support and Confidence metric 

Fig. 6 indicates the graphical 
representation of the time and index value. This 
plot has an average min and max values. When the 
index has its value lower or greater than its average 
min and max value, it is defined that the area might 
have strong waves. 




Fig. 6 Analysis 



Fig. 3 Region based segmentation 

Data with same header values belong to a 
certain region are grouped in this step. Index value 
is estimated after the segmentation of regions and is 
given in Fig. 4. 
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Method to extract the similar pattern is 
depicted in fig.6. Here the value of confidence is 
plotted against the time value. 




Fig. 7 Time Vs Confidence 



Fig .4 Ocean Index Value 
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This graphical representation defines the 
fishing zone of the ocean. Inference obtained from 
the graph is that when the confidence value goes to 
its peak value with respect to time, then the area is 
referred as potential fishing zone. 

V. CONCLUSION 

Text Mining is a mining technique which 
extracts the characteristics of any system. In this 
project, it is proposed to generate a report about the 
ocean system. This report acts as a warning system 
of disasters like tsunami and storm. Also its 
information database contains the location of ores 
and mines under water, which increases the 
economy of our country. Tourism Board and Oil, 
Gas Producers of our country highly depending on 
the ocean for their growth. It helps the fisherman 
by listing the major fishing zones. It also used to 
alert the public at the time of disasters. 
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Abstract — Sequence Alignment process in DNA datasets faces 
different concerns, one of them is the complexity of finding any 
sequence since the data is unstructured and unrelated. Hadoop 
solves some of these issues by dividing the data into many blocks 
and manipulates these data perfectly with high efficient process. 
However, applying Hadoop has to be more accurate because 
DNA still needs more reliable and efficient solution because some 
problems might be not reliable via using Hadoop. In this project, 
I will explain until what extend Hadoop can solve the DNA 
sequence alignment with high degree of reliability. 

Keywords — Cloud Computing, DNA Sequence Alignment, Hadoop, 
MapReduce. 

I. Introduction 

In bioinformatics, many applications need more time 
and a high degree of functional and computational capabilities 
either in hardware or software level to be applied. Hadoop and 
MapReduce solve this part of the capabilities by having a 
cluster and divide the data sets in the cluster, so not only the 
data is divided but the computation also is divided between the 
slaves in the cluster. In Hadoop we have the master node, 
which is the NameNode/JobTracker, and we have many 
slaves, which are DataNodes/TaskTrakers that store the data 
chunks/blocks and doing the computation themselves. 

DNA chromosomes datasets can be considered as a 
big data even it is not very huge data but still unstructured and 
unrelated data. So, with this kind of data any process could 
have some complexities to be achieved with a high degree of 
reliability and efficiency. Hadoop divides the data into many 
blocks and store them on the DataNodes as a virtual file 
system, which is Hadoop Distributed File System HDFS. 
Different tools that support Hadoop to complete its job can be 
used to simplify the process of execution jobs as Zookeeper, 
Hive, Pig and so on [3]. 

Hadoop Distributed File System [Figure 1] allows the 
distribution of the data set into many DataNodes in the cluster 
that can be logically combed for processing. The process of 
distributing data in the cluster happens by selecting the size of 



the block in the Hadoop configurations. So, the user either 
stays with the default size, which is 64MB or changes it into 
128MB [7]. Consequently, the job also is distributed into 
many tasks and each task gets executed on each TaskTracker 
on the local block/blocks. 




Many DataNodes 

Figure 1 : Hadoop MapReduce workflow. 



In this project we will go through the issues that 
using Hadoop to manipulate the DNA data sets that cloud 
happened and we will propose a solution to skip these issues. 
In the second section, we will have a complete description of 
the DNA data format then in the followed section we will 
explain the problem and propose a solution to solve this 
problem. Then we will end up with some future work to have 
an efficient solution using Hadoop in DNA data sets. 

II. DNA Backgroud and Data Format 

To have an efficient solution with any job on any data 
set, you have to understand the format of the data first and see 
if you can control it by having <key, value> on it or not. 
Understanding the data format is the important part of building 
a good MapReduce job. In this part I will explain the critical 
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areas in the DNA data sets that cloud cause a problem or 
weaknesses when execute some jobs on the blocks. 

1) DNA background 

DNA genome sequence consists of 24 chromosomes 
each consists of a huge size of sequence data (getting updated 
during the time) that is represented by an upper case English 
letters pairs. Each chromosome can be divided into many 
genes that are implemented by these pairs of letters. Scientists 
need to find some subsequences within these chromosomes to 
determine either some diseases or proteins frequently. 

Each chromosome has many known genes and many 
unknown sequences. For example, chromosome one consists 
of about 249 million of nucleotide base pairs, which represent 
about %8 of the total DNA in human cells. The total number 
of genes in chromosome 1 is about 4,316 genes each one has 
different length of base pairs. There are about 890 known 
diseases that are related to this chromosome like Alzheimer 
type 4 [2] [4]. From this briefly explanation about chromosome 
1 you can imagine the data size of the DNA. 

2) DNA data format 

The most critical point here is the data format that 
from where the job reads the data. In DNA chromosomes, the 
data is unstructured and unrelated so the job needs to read the 
data carefully. [Figure 2] shows a part of the data that in the 
DNA and its structured [4]. As we can see, there is a line that 
has the metadata of the followed section, which is the 
sequence itself. So, based on the job type you can decide 
either you need to read some from this line or not. In sequence 
alignment example we need to read the name of the 
chromosome. 



The other lines have the sequence of a part of that 
genome which is the letters (A, C, G, T and N somewhere). 
Writing a MapReduce job using some programming languages 
cloud be easy or difficult based on the job itself. 

III. Proplem and Proposed Solution 

1) Problem Definition 

While the most of the sequences in the chromosomes 
are already defined, there are still frequent processes that 
reformulate and define these chromosomes with some updated 
data during the time. So, the process of finding any sequence 
still needed and the result could be different from the previous 
ones. 

Having this type of unstructured data makes the 
process of finding a specific sequence very complicated and 
takes long time. So, with some online-processes the finding 
sequence process will cost the user some expenses with 
classical research techniques that he wouldn't have if he use 
the web-based distributed file system, which is supported by 
some models like Hadoop. 

DNA data sets has a problem of Tow-line sequence 
which means the targeted sequence might be separated 
between two lines in the file. For example, if we have the 
targeted sequence is "GGGGCGATA" we might have it 
divided between two lines. This situation does not give an 
efficient solution because the divided sections will not be 
accounted in the solution. 

The Two-lines sequence problem is shown in the 
sequences that are built in the NCBI project [5]. However, 
there might be another project in future could represent the 



>gi|157811750|ref|NW_001838574.2| Homo sapiens chromosome 1 genomic scaffold, alternate assembly HuRef 
SCAF 1 103279180564, whole genome shotgun sequence 

ATTACATTTTATTCCATTCCATTCCATTCCATTCCAGCACATTTCATTCCATTACATTCCTTTCGAGTCC 

AATCCATTCCATTCCATTCCTATTGAGTCCATTCAATTCCATTCCATACCATTCGAGTCCATTCCATTCC 

ACTCCATTCCATTCCATTCCATTCCATTCCATTCGCGTCCATTTCATTCCATTACATTACATTCCATTCG 

AGTCCATTCCATACATTCCGTTAGACTCGAATCCATTCAATTCCATTCCATTCGCATACATTCCACTCCA 

TTCCATTCGAGTCCATTCCATTCCATTCCATTCCACTCGAGTCCTTTCCATTCCATTCGAGTCCATTCCG 

TTCTATTCCATTCCTTTCCAATCCATTCCTTTTCATACAGTCCATTCCAT 

>gi|157811752|ref|NW_001838574.2| Homo sapiens chromosome 1 genomic scaffold, alternate assembly HuRef 
SCAF 1 103279180564, whole genome shotgun sequence 

TTATTCCATTCCATTCCATTCCATTCCAGCACATTTCATTCAGGACTTCCATTACATTCCTTTCGAGTCC 
AATCCATTCCATTCCATTCCTATTGAGTCCATTCAATTCCATTCCATACCATTCGAGTCCATTCCATTCC 



data somehow but without having this problem. 



20 



http://sites.google.com/site/ijcsis/ 
ISSN 1947-5500 



(IJCSIS) International Journal of Computer Science and Information Security, 
Vol. 12, No. 5, May 2014 



The main objectives behind this project are: 

1 . Speed up the process of finding a specific pattern by 
using Hadoop environment. 

2. Build a MapReduce program to find a given sequence 
and how many this sequence is replicated in the 
specific chromosome or on all chromosomes. 

3. Make the process of finding a specific pattern as 
speed as we can by executing this program in a 
cluster or by one machine. 

2) Proposed Solution 

To solve the two-lines sequence we need to test the 
length of the targeted sequence first and try to match 
whatever we have to find the results. In Hadoop, you can 
determine the format of the results e.g. yes/no result, number 
of matches either in whole DNA or in a specific 
chromosome. Generally, in Hadoop you can determine the 
format of the results based on the format of the data. Some 
data has line number on it, so you can determine the specific 
line that carries the result. Here is a part of the code that 
distributes data into lines then try to find match the sequence 
with the whole line to fine if they matched then add the 
counter by one. 



The above code solves the problem of finding the 
sequence in each line separately but we would solve the 



problem of Two-Lines sequence. So, we need to do some 
processes as follows: 

1 . Find the length of the sequence e.g. length is (SL). 

2. Save the last digits of the sequence from the 
previous line with the length of (SL-1) and put that 
in a temporary string (Tempi). 

3. Save the first digits from the current line with the 
length of (SL-1) and put that in a temporary string 
(Temp2). 

4. Merge Tempi with Temp2 to get a new line 
(Temp3= Tempi + Temp2). 

5. Now, we can apply the finding function on the new 
line (Tmep3). 

6. We have to have this process between each two lines 
that could have Two-Lines sequence is separated 
between them. 

There could be different solutions or additions to this 
solution to have it more efficient like testing (Tempi) if it 
matches the first digits of the sequence then go further with 
this solution. However, we need to do the testing process 
many times each has different length based on the previous 
length. Current length equals to the previous one minus one 
and so forth. 

IV. Current Solution 

In this proposed solution there is some overload work 
that cloud be solved by developing the solution to skips the 
non-beneficial data in DNA. There is a line in the beginning of 
each genome that carries the metadata of that genome, so we 
can skip reading this line but that will produce more work in 
the code to solve a tiny problem. So I preferred to leave it as it. 

In addition, there is one more issue which the 
Unknown parts of the chromosome that have N on them. Also, 
we can consider that as a tiny overload work that we can skip 
or manipulate based on the job itself. 

V. Future Work 

As we mentioned previously, this project was applied 
on the data that we get from the NCBI project that formulated 
the data in files in the format that we explained in section II. 
So, I recommend the future work to solve the problem of Two- 
Lines sequence either by developing the current data files 
using one of the applications or by search and find another 
project that formulates the data with no Two-Lines sequence. 



String line = value.toStringQ; 
String sequence= "GGGGCGGGG"; 
Pattern pattern= Pattern.compile(s); 

FileSplit filesplit= (FileSplit) context. getlnputSplit(); 
Path path= filesplit.getPathQ; 
String filename= path.getName(); 

String chrstring-'"; 

for (String subline : line.split("\\W+")) 
{ 

if (subline.lengthQ > 0) { 

chrstring = chrstring.concat(subline);} 

} 

Matcher matcher= pattern.matcher(chrstring); 
int counter=0; 

while (matcher.find()) counter++; 

context.write(new Text(filename), new IntWritable(counter)); 
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VI. Conclusion 

Finding a sequence in DNA is very important process 
for many scientists fro many areas. However, we cloud have 
this process done by the regular searching ways but it will cost 
time and resources. 

In this paper, we can see the weaknesses in the DNA 
chromosomes data from project NCBI that might cause a non- 
perfect job in the Hadoop Map/Reduce in the environment of 
Cloud Computing. So, by proposing a solution above, we 
could solve the Two-Lines sequence data. In the future work 
section, we described some of points that might help to have 
more efficiency and functionality. 
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Abstract — This paper explains how Scaieform Gfx technology 
can be used to create a beautifully rendered Front-End Menu 
using Flash Professional's proprietary language, ActionScript, 
both in 2D and 3D for video game development. Scaieform Gfx is 
cutting edge technology designed by Autodesk for creating 
stunning user interfaces. 

This paper introduces Scaieform Gfx as videogame 
middleware, reviews some of the previous games' user interfaces 
designed with Scaieform Gfx, provides an iteration of steps that 
can serve as a framework for creating a fully functional front- 
end menu and discusses the limitations of Scaieform Gfx. 

Also, this paper introduces Scaieform CLIK and Scaieform 
3Di and explains how interoperable they are with each other 
when developing a user interface. 

I. Introduction 

Video game development is usually a very tasking process 
because of all the numerous development highlights like 
programming, level design, lighting, rendering, concept 
design and so much more. But more than half of the entire 
development effort usually goes to developing the user 
interface. The user interface may be the Front-End Menu, the 
Heads-Up-Display or even an Inventory Menu. What makes 
the user interface so important? The answer is simple. Without 
the user interface, there is no interaction between the user and 
the game. A user interface is simply the system by which 
people (users) interact with a machine. A video game's user 
interface is the conduit between the player and the game. It is 
all that stands between the user and the game. But developing 
the user interface is not nearly enough. Aesthetics can help to 
improve the overall audio and visual appeal of the interface to 
be used, hence simulating the effect of "gluing the user to the 
screen". That is where Scaieform Gfx comes in. 

Autodesk® Scaieform Gfx® enables developers to 
leverage the power of the Adobe® Flash® tool set to create 
powerful and immersive user interfaces. Scaieform Gfx 
provides a streamlined solution to create hardware -accelerated 
3D game menus, HUDs, animated textures, in-game videos, 
and mini-games and is used in over 1,000 AAA titles across 



all major platforms. It is integrated with some game engines 
like Unreal Engine 3 and 4 and can be installed as a plugin for 
others like Unity 4 and CryEngine 3. As opposed to the batch 
computing era and command-line era where the entire user 
interface was in black and white (MS-DOS), and as opposed 
to other software with user interface development capabilities 
as a side attraction, Scaieform Gfx can be used to create very 
complex, powerful, immersive and detailed interfaces that run 
at smooth 60fps. 

User interfaces usually include the hardware (physical) and 
software (logical) components. Whether or not the designer(s) 
focuses on either or both of the logical or physical 
components, user interfaces must provide a means of: 

Input, allowing the users to manipulate the system 

Output, allowing the system to indicate the effects of 
the users' manipulation 

Hence, the major goal of user interface design and 
development is to improve the interactions between the users 
and computers by making computers more usable and 
receptive to the users' needs. The added boon of Scaieform 
Gfx is that user interfaces developed with Scaieform Gfx can 
do all that while also rendering aesthetically pleasing and 
extremely powerful views. 



II. WHAT IS SCALEFORM GFX? 

Scaieform Gfx is the leading Flash-based user interface (UI) 
middleware for the video-game industry. Scaieform Gfx 
combines the performance of hardware accelerated 3D 
graphics technology with the proven productivity and 
workflow of Adobe Flash tools to help developers rapidly 
create highly immersive 3D UI and casual game experiences 
[8]. Its key features are: 

It provides for anti-aliased vector graphics which 
allow developers to create fully-fledged menus with 
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scrollbars, option steppers, HUDs, and checkboxes 
and so on [2] . 

It provides a high-quality dynamic font system which 
renders fonts into vector images which can be used to 
render text fields with low memory cost [2] . 
It supports rich text support including different 
colours, smileys, and fonts - basically anything that 
can be included in a text field [2]. 
It supports masking for arbitrary shape clipping and 
shape animation. This is important for animations of 
shapes that are not regular like rectangles. An 
example is a heart-shaped animation [2]. 
For artists and designers, especially those that use 
Photoshop and other Image-editing software, 
Scaleform Gfx allows for changing in blend modes 
and filter effects like 'Glow', 'Multiply', and 'Drop 
Shadow' and these can be applied to both text and 
images [2]. 

While Scaleform Gfx is primarily a 2D tool, 
Scaleform Gfx provides extensions for Flash 10 and 
up that allow for 3D translations and transformations 
like tilting on the axes and moving from point A to B. 
These extensions make up Scaleform 3Di [2]. 
Scale9Grid enables developers to resize user 
interfaces easily and solves the problem of cross- 
platform development [2] . 

Scaleform Gfx 4 features a multi-threaded back-end 
design and is integrated with the Unreal Engine 4 
which is also a multi-threaded game engine [7]. 
Multi-threading is the concept of breaking a program 
into two or more parts called threads which run in 
parallel to improve effectiveness, responsiveness and 
performance. 

A. Scaleform CLIK 

The Scaleform Common Lightweight Interface Kit (CLIK) 
allows developers to rapidly prototype efficient front-end 
menu interfaces for games, and then take those prototypes to 
completion by skinning [2]. Skinning is the process of 
applying, adding or replacing the base components of the 
interface with more detailed, production ready components 
without necessarily changing the workflow of the interface. 
The kit contains buttons, checkboxes, scrollbars, radio buttons, 
option steppers and every other component that can be used to 
design a user interface. 



The 3Di extensions are: 

._xrotation: This increases or decreases the rotation 
of an object on the x-axis. 

•_yrotation: This increases or decreases the rotation 
of an object on the y-axis. 

._perspfov: This extension controls the perspective 
distortion of a 3D object. The lower the value, the 
less the distortion. By default, its value is 55 degrees 
but it can be assigned any value between 1 and 179 
degrees. 

_z: This translates an object along the z-axis. The 
lower the value, the closer the object translates to the 
screen. 

_zscale: This alters the scale of any z-axis translation. 
It functions identically to _xscale and _y scale. It 
defaults to 100 or 1:1 ratio. 

_matrix3d: This allows developers to set the 
complete 3D transform of an object using an array of 
16 floating point numbers, making it a 4 x 4 matrix. 
Whenever any of the previously mentioned 
extensions are used on an object, Scaleform Gfx will 
automatically apply a 3D matrix to that object and 
bring it into 3D space. After that, the only way to 
return that object to 2D is to set that matrix to null. It 
is available as a low-level option for programmers 
who want to create some unique transformations. 

Scaleform 3Di does have a few limitations: 

There is no depth-sorting. This means that when 
objects are rotated or translated, they may not be in 
the right order over or under other objects as 
expected. This is because the draw order is 
determined by the layer order in Flash, not their 
positions in 3D space [8]. 

There is no back-face culling. This means when an 
object is rotated, the back side will be drawn. Unless 
this is the effect desired, rotation should be limited to 
90 degrees in either direction [8]. 



III. USER INTERFACES DEVELOPED WITH SCALEFORM GFX 



B. Scaleform 3Di Here is brief list of gaming titles that have used Scaleform 

Gfx in recent years: 

Scaleform 3Di uses hardware accelerated triangles to render starforce Battlement, Batman: Arkham Origins, Killer is Dead, 

shapes. What this means is that it uses the capabilities of the Lost pi anet 2, Dragon Age 2, Crysis 2, The Witcher 2: 

graphics card to render objects in 3D. Scaleform 3Di contains Assassins of Kings, Borderlands 2, DC Universe Online, 

extensions to enable the manipulation of 2D objects in 3D Street Fighter IV, Dragon Age Origins, Mass Effect, Mass 

space. When an object has both 2D and 3D properties, the 3D Effect 2 , Prototype, Fable II, Fable II: Pub Games, 

properties are applied before the 2D properties. This detail is Civilization Revolution, Civilization IV, Civilization IV: 

important to understand to get the expected results when Beyond the Sword, , Civilization IV: Warlords, , Civilization 

rotating an object or translating it [1]. IV: Colonization, Halo Wars, 007: Quantum of Solace, 
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Railroads!, Kung Fu Panda, Viva Pinata: Party Animals, 
Mafia II, SFVT Shots, and Trivium, to name but a few. We 
take a look at a few examples of user interfaces mentioned 
above that are very crucial to this research. 

A. Starforce Battlement 

Starforce Battlement is a tower defence game with role- 
playing game elements. The primary objective of each level is 
to prevent oncoming waves of enemies from reaching their 
destination. Every enemy that reaches the destination subtracts 
one point from the player's life. If enough enemies reach the 
destination, the player's life is reduced to zero and the game is 
over. Starforce Battlement is written entirely in ActionScript 3 
and leverages Scaleform Gfx's 'Shippable Mobile Player' for 
cross -platform deployment. It goes without saying that the 
whole game is like a giant cluster of user interfaces. The Main 
Menu allows the player to select a mission and provides 
additional options for changing the player's technology tree 
and viewing tutorials, the player's statistical history, and the 
player's achievements. The buttons at the top left of the Main 
Menu allow the player to toggle music, toggle sound effects, 
or exit the game. Level selection consists of indicators placed 
on the galaxy/nebula map which the player can select. Each 
nebula contains a set of missions that are unlocked in a linear 
progression. The player can drag the map around in 2D space 
to explore available options before selecting a particular 
mission. When the player selects an available mission, the 
Mission Data View (MissionDataView.as) is displayed. The 
Mission Data View provides the player with details about the 
mission. From this view, the player may either launch the 
mission by selecting the "Enter Mission" button or may return 
to the Mission Select View by selecting the "X" button at the 
top right of the view. 

B. Batman: Arkham Origins 

Developed in 2013, Batman: Arkham Origins is an action- 
adventure video game published by Warner Bros. Studios. 
The user interface features a very sleek menu design as soon 
as the player gets into the game and past the splash screens. 
The Menu features buttons that change their states depending 
on whether the player's cursor is clicking on them, hovering 
over them or not hovering over them at all. In the background 
there is a rather obvious cascade of slides that blur the line 
between image and video. The game's user interface accepts 
input from both a combination of the keyboard and mouse and 
from gamepads. 

C. SFVT Shots (Street Fighter versus Tekken™) 

SFVT Shots is mobile app that displays high resolution 
screen grabs of the popular game Street Fighter Versus 
Tekken. The images displayed can be rotated and tilted on 
every axis (Figure 1). Swipes on the screen allow for the 
interface to display the next or previous image as intended by 



the user and each iteration in the image swiping process 
presents the new image with a different transition. The next 
update for the game will allow images to be dynamically 
saved to the app. The app will be available for free soon on 
the Google Play store. 




Fig. 1: SFVT Shots. 

D. Trivium 

Trivium is a new mobile game still under development (our 
main video game design for this research) for Android. The 
menu prototype was created entirely with Scaleform CLIK 
and 3Di but it has not been skinned yet. Trivium is a trivia 
game for anime fans all over the world and features a dynamic 
score system, a dynamic timer system, an achievement system 
and an Inference Engine. Upon completion, the game will be 
available for purchase on the Google Play store. 



IV. CREATING A SIMPLE FRONT-END MENU WITH SCALEFORM 
GFX 

This section will detail the steps necessary to create a front- 
end game menu for a video game called Trivium. Trivium is 
an indie flash trivia game targeted at anime (Japanese-themed 
cartoon) audiences all over the world. As with Starforce 
Battlement, Trivium is written entirely in ActionScript and 
utilizes both Scaleform CLIK and Scaleform 3Di. Note that all 
the steps taken here are implemented in Flash Professional 
(Fig. 2). 




Fig. 2: Trivium in Flash Professional 



The custom user interface that will be created in this 
document will have the following workflow: 
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Fig. 3: Front-End Menu Work Flow. 



After configuring CLIK and installing the Scaleform 
Launcher in Flash [6], development commences immediately 
in the following steps: 

Draw a sketch of what the Front-End Menu looks 
like (Concept Design - Fig. 3). 

Design all art assets to be used in Photoshop or any 
other Image Editing Software. 

Open the CLIK AS2 file and copy the library. 

Open a new file in Flash. Set the language to 
ActionScript 2. Save the file as Front-End 
Menu_Prototype.fla. 

Paste in the library of CLIK AS 2 into 
Menu_Prototype. 



Import all art 
Menu_Prototype.fla. 



assets 



into Front-End 



Create a new layer called 'Background'. Drag and 
drop the background asset(s) to the scene while 
remaining on the layer according to the concept 
design. 

Create a new layer called 'Main Menu'. Click on the 
'Shapes' tool. Draw a rectangle whose width and 
height are more than the scene's width and height. 
Change its fill property to empty. 

Save the rectangle as 'Rec' of type movieclip by 
dragging it into the library. 



Delete the rectangle on the scene. Drag and drop an 
instance of Rec onto Frame 1 of Man Menu. In the 
instance name field, type in 'menucontainer'. 

Create another layer called 'Actions'. All other layers 
except 'Labels' should be below this layer. 

Double-click on menu-container. Create 7 new layers 
and name them 'Actions 2', 'Startbtn', 'Shopbtn', 
'Aboutbtn', 'Creditsbtn', 'Optionsbtn' and 'Exitbtn'. 
The last layer called 'Layer 1' is where the rectangle 
resides. Rename it as 'Rectangle'. 

Drag and drop one Button to each of the layers 
except Rectangle. No work is going to be done on 
Rectangle. It simply acts as the visualization of the 
placeholder of the menu. In the instance name field, 
type in 'startbtn', 'shopbtn', 'aboutbtn', 'creditsbtn', 
'optionsbtn' and 'exitbtn'. 

Create 7 new layers. Name them 'Start Menu', 'Shop 
Menu', 'About Menu', 'Credits Menu', 'Options 
Menu', 'Exit Menu'. Go to Frame 2 of Start Menu. 
Insert a button with instance name 'backbtn. Insert 
backbtn on Frame 3 of Shop Menu, Frame 4 of 
About Menu, Frame 5 of Credits Menu, Frame 6 of 
Options Menu and Frame 7 of Exit Menu. 

To use Scaleform 3Di to apply transformations and 
translations to our buttons, click on Frame 1 of 
Actions 2 and type in: 

//To translate each button along the z-axis so each 
one appears offset to another 

startbtn._z=-50; 

shopbtn._z=130; 

aboutbtn._z=79; 

creditsbtn._z=32; 

optionsbtn._z=150; 

exitbtn._z=100; 
//To change the cursor button to the hand button 
when the cursor is on the buttons and to enable 
mouse clicks on the buttons 

startbtn.buttonMode= true; 

shopbtn.buttonMode=true; 

aboutbtn.buttonMode=true; 

creditsbtn.buttonMode=true; 

optionsbtn.btnMode=true; 

exitbtn.btnMode=true; 
Do the same for all the buttons called 'backbtn'. 

To go to the respective sub menus when each button 
is clicked on: 

startbtn. addEventListener(MouseEvent.CLI 
CK, startmenu); 

function startmenu(e vent : MouseE vent) : void 
{ 
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gotoAndStop(2); } 

shopbtn.addEventListener(MouseEvent.CLI 
CK, shopmenu); 

function shopmenu(event:MouseE vent): void 
{ 

gotoAndStop(3); } 

aboutbtn.addEventListener(MouseEvent.CL 

ICK, aboutmenu); 

function 

aboutmenu(e vent: MouseE vent) : void { 
gotoAndStop(4); } 

creditsbtn.addEventListener(MouseEvent.C 
LICK, creditsmenu); 

function exitmenu(event:MouseEvent):void 
{ 

gotoAndStop(5); } 

optionsbtn.addEventListener(MouseEvent.C 

LICK,optionsmenu); 

function 

optionsmenu(e vent: MouseE vent) : void { 
gotoAndStop(6); } 

exitbtn.addEventListener(MouseEvent.CLI 

CK, exitmenu); 

function 

exitmenu(event:MouseE vent): void { 
gotoAndStop(7); } 

To go back to the Main Menu when any 'backbtn' is 
clicked on, type this on each frame of the Actions 
layer where backbtn resides: 

backbtn.addEventListener(MouseEvent.CLI 
CK, gobacktomainmenu); 
function 

gobacktomainmenu(event:MouseE vent): void { 
gotoAndStop(l); } 

Go back to the original scene and click on Actions. 
Type in: 

//Create four variables xvar, yvar, xhold, yhold that 
hold four different values. 

/* Functions rotatex and rotatey rotate the movieclip 
menu_container. Both functions are called every 
frame by the function 'beginidlerotateloop'*/ 

var xvar:Number=0.030 

var yvar:Number=0.035 

var xhold:Number=10; 

var hold:Number=15; 

function rotatex(mc :MovieClip) : void 

{ 

/*set menucontainer's rotation on the x- 
axisto be equal to its rotation plus xvar 
every frame */ 

mc._xrotation= mc._xrotation + 

xvar; 

/*if menu container's rotation value is 
greater than xhold or less than negative 



xhold, change xvar to negative xvar, thereby 
rotating in the opposite direction */ 

if (mc._xrotation>xhold II 
mc._xrotation< -xhold) 

{ 

xvar= -xvar 

} 

} 

function rotatey(mc:MovieClip):void 
{ 

/*set menu container's rotation on the y- 
axisto be equal to its rotation plus yvar 
every frame */ 

mc._yrotation= mc._yrotation + 

yvar; 

/*if menu_container's rotation value is 
greater than yhold or less than negative 
yhold, change yvar to negative yvar, thereby 
rotating in the opposite direction */ 

if (mc._yrotation>yhold II 
mc._yrotation< -yhold) 

{ 

yvar= -yvar 

} 

} 

//Call functions rotatex and rotatey every 
frame 

function beginidlerotateloop():void { 

onEnterFrame = function() { 

rotatex(menu_container); 

rotatey(menu_container); 

} 

} 

To change the Main Menu's field of view as it 
rotates, type in: 

//change menu container's field of view to 
70 degrees 

menu_container._perspfov=70; 




Figure 4: Trivium using Scaleform Gfx. 
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In the 20 steps iterated above, Scaleform 3Di and 
Scaleform CLIK were used to create a front-end menu 
prototype that tilts back and forth on both the x-axis and y- 
axis with fully functioning clickable buttons that can go to 
each individual sub-menu and go back to the main menu when 
the back button is pressed. 

This simple framework can be used to design user interfaces 
that are so complex with so many nested movieclips, while 
running smoothly at 60fps. 

V. Conclusion 



[5] Autodesk (2014): "Scaleform Gfx CLIK User Guide AS3". 
Scaleform Gfx SDK Documentation. Page 1 - 40. USA. 

[6] Autodesk (2014):"sf_4.2_getting_started_with_CLIK". 
Scaleform Gfx SDK Documentation. Page 1-41. USA 

[7] Kris Graft (2010): "Scaleform Gfx GFx 4 Supports Flash 
10 AS3". Gamasutra. Page 1. USA. 

[8] Mathew Doyle (2010): "Using 3Di Flash AS2 Extensions". 
Scaleform Gfx Video Tutorial. USA. 



This paper presented one possible way to use Scaleform 
CLIK and Scaleform 3Di. There are many permutations, and 
most projects will present unique challenges. The framework 
given above in the form of steps enables developers to design 
and develop fully fledged user interfaces, particularly the 
Front-End Menu. However, it can also be used for developing 
Heads-Up-Displays as only a few changes need be made. For 
example, the button system might be removed altogether and 
replaced with screens, boxes and bars to display the user's 
health points, status, or location on the map. The common 
denominator, however, is that Scaleform Gfx is a really 
powerful tool that can be used to create really complex user 
interfaces in minutes, not days and should be utilized by every 
user interface designer in the video game industry. 



VI. RECCOMENDATIONS 



As great as Scaleform Gfx is, it could still be improved. 
Scaleform 4 would be expected to support multi-threading as 
mentioned in Section 2.1 and also be integrated into game 
engines like Unreal Engine 3, CryEngine 3 and Unity 4, 
instead of having to download plugins. Also, depth-sorting 
and back-face culling problems limit the options of 
imaginative developers and should be remedied. 
Finally, as mentioned above, Scalefrom Gfx is recommended 
for any user interface designer in the market as it is indeed the 
leading Flash-based user interface development middleware 
today. 



References 

[1] Autodesk (2010): "Intro to Scaleform for UI". Scaleform 
Gfx Video Tutorial. USA. 

[2] Autodesk (2010): "3Di Overview". Scaleform Gfx SDK 
Documentation. Page 1 - 40. USA. 

[3] Autodesk (2010): "Autodesk & Scaleform Gfx". 
Scaleform Gfx SDK Documentation Page 1- 30.. USAA 

[4] Autodesk (2010): "Mobile Game Kit Overview". 
Scaleform Gfx SDK Documentation. Page 1 - 35. USA. 



28 



http://sites.google.com/site/ijcsis/ 
ISSN 1947-5500 



(IJCSIS) International Journal of Computer Science and Information Security, 

Vol. 12, No. 5, May 2014 



Bovines Muzzle Identification Using Box-Counting 



Hazem M. El-Bakry 



Ibrahim El-Hennawy 



Hagar M. El Hadad 



Department of Information Systems, 
Faculty of Computer and Information 
Sciences, 

Mansoura University, Mansoura, Egypt 



Faculty of Computer Science and 
Information , 
Beni-Suef University 
Beni-Suef, Egypt 



Faculty of Computer Science and 
Information, 
Beni-Suef University 
Beni-Suef, Egypt 



Abstract — Bovines identification has become widely used as 
essential for guarantee the safety of cattle products and assists 
veterinary disease supervision and control. Texture feature 
extraction is a key step for muzzle image processing. In this paper, 
we focus on bovines muzzle patterns identification as a biological 
texture using a method for feature extraction of Muzzle images. The 
proposed method has been implemented by using Box-counting 
Fractal Dimension. Before texture feature extraction, preprocessing 
operations such as histogram equalization and morphological 
filtering (opening and closing) have been used for increasing the 
contrast and remove noise of the image. After that, fractal 
dimension is calculated as the texture feature. The experimental 
results show that feature vector for different image of the same 
muzzle are highly symmetry. Therefore, it can be applied in 
registration of bovines for breeding and marketing systems. 

Keywords-component; bovine 's identification, image processing, 
fractal dimension, feature extraction, Box-counting 

I. INTRODUCTION 

Nowadays Veterinarians do great effort in the livestock 
field. Bovines identification is used for exhibition of bovines, 
breeding and marketing. Early bovines identification method 
such as ink printing method has disadvantage that is the good 
muzzle print take time reach up to 6 minutes [1] and the difficulty 
in obtaining. Also ink print method problems are not holding the 
animal still, the use of too much ink, a build-up of moisture on 
the animal's nose and result in unreadable print [2]. 

In humans we use fingerprint as biometric identifier and 
in bovines hair covered animal skin except some parts such as 
muzzle. The distribution of ridges and valleys in each cattle 
muzzle is responsible for forming bovine's pattern. Because of 
the muzzle consistent over time so it use as like fingerprints of 
human to form biometric identifier. Bovines pattern is hereditable 
and asymmetry between each half is meaningful [3]. Now we 
captured a life image of cattle and do some image enhancing and 
feature extraction methods to automatically identify muzzle 
patters than ink printing method then we use mathematical 
morphology filtering in order to use it in bovine's identification 
[4]- 

Lot of improves has been done of texture feature 
extraction than past 50 years. Texture feature extraction is a key 
fundamental in image classification and segmentation [5]. First 
applying histogram equalization on image that produce a gray 



scale which makes all pixels values closed as possible to contrast 
adjustment and enhancement. Then by using Mathematical 
morphology filtering noise in the image can be removed 
effectively. So after histogram equalization the low contrast 
image transformed into high contrast images and noise removed 
after applying opening and closing operations on the image. 

The fractal dimension is calculated as the texture feature 
of image. The fractal dimension of an image can reflect the image 
feature, so some measures are taken to have the direction details 
of the image to better express the important texture feature 
information in Muzzle image. 

The rest of the paper is organized as follows. Image Pre- 
processing used in this paper is described in Section II. Feature 
Extraction is detailed introduced in Section III. In Section IV 
Experimental result is presented. Conclusion and Future Work 
are reported in section V. 

II. Image Pre-processing 

Image pre-processing is applied at the lowest level of 
elimination and its aim is to reduce undesired noises and enhance 
the image data which is important for further processing [6]. It is 
required for improving the performance of image processing 
methods such as image segmentation and image feature 
extraction [7] [8]. Image pre-processing operations such as 
histogram equalization (HEQ) to increase the contrast of each 
image and mathematical morphology filtering (MM) to remove 
image noise must do before texture feature extraction [23-46]. 

A. Histogram equalization (HEQ) 

First, histogram is the probability distribution of a particular 
type of data. One type of histogram is image histogram which 
gives us graphical representation of the distribution of the gray 
values in a digital image. Image's histogram analyzes the 
frequency of appearance of the different gray levels contained in 
the image. Histogram equalization spreads out intensity values 
along the total range of values in order to achieve higher contrast. 
This algorithm is useful in image which is represented by close 
contrast values, such as images in which both the background and 
foreground are bright at the same time, or else both are dark at the 
same time [9]. 



Histogram equalization method is implemented as shown in 
the following flow chart that outlining the following steps. 
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Consider a digital image with gray levels in the range [ 0, 
L— 1], Probability Distribution Function of the image can be 
computed as equation (1): 



r^> = -^ h = h ,£--1 



Where r k is the k th gray level and n k is the number of pixels in 
the image having gray level r k . 

• Cumulative Distribution Function (CDF) can also be 
computed as followed: 



K = 0, L-l , 0< C( 7 < l 



Histogram Equalization (HE) appropriates gray level S^ 
to gray level r k of the input image using equation (2). So 
we have: 



<3? 



Gray level S k 's changes can be computed in usual 
histogram equalization method: 



Equation (4) means that distance between Sk and Sk +1 has 
direct relation with PDF of the input image at gray level r^ [10]. 

B. Mathematical morphology filtering (MM) 

Morphology means the study of shape. Mathematical 
morphology describes shapes by using set theory, lattice theory, 
topology, and random functions depending on mathematical 
theory. In image processing, mathematical morphology used to 
determine the interaction between image by using some operations 
such as erosion, dilation, closing and opening. Here in this paper 
first we do opening operation on the image followed by do closing 
operation on the image in order to remove noise points, fill the 
edge of some pixels, remove small lines and connect thin break 
areas in muzzle image [1 1]. 

• Dilation and erosion operations are not inverse operators. If 
X is eroded by B and then dilated by B, one may end up with 
a smaller set than the original set X. This set, denoted by X o 
B, is called the opening of X by B defined by X o B 

— j/L* £j) B. Likewise the closing of X by B is the 
dilation of X followed by the erosion, both with the same 
structuring element. The closing of X by B may return a set 
which is larger than X; it is denoted by X • B and defined by 

X • B = (X I) ©B. 

• Dilations and erosions are closely related. This is expressed 
in the principle of duality [12] that states that 



x i£= 0 <$$y 



IT 



Where the complement of X, denoted , is defined as 
^ c ■ (i? i £ |j? 3 , and the symmetric or transposed set of B 

^ is the set defined as $ ■ {— £>|i> i. Therefore all 
statements concerning erosions and openings have a parallel 
statement for dilations and closings, and vice versa [13]. 

• The opening of A by B is obtained by the erosion of A by B, 
followed by dilation of the resulting image by B: 

A o B = (A © 

• The closing of A by B is obtained by the dilation of A by B, 
followed by erosion of the resulting structure by B: 



A • B 
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III. Feature Extraction 



Image feature extraction is very important in many image 
processing applications such as classification and recognition. In 
this paper we use box counting algorithm feature extraction 
methods. This algorithm is implemented for texture analysis of 
muzzle database. 

A. Texture feature extraction using box counting method:- 

Texture feature extraction is the second step after preprocessing 
operations. Some transforms performed in the resulting image 
(closed image). There are found several methods to calculate 
fractal dimension of muzzle image, but a lot of studies show that 
box counting algorithm is common used in fractal dimensions 
calculations [14]. Gang pain and Roques-Carm advanced this 
method [15]. Sarker and chaudhuri upgrade this algorithm to 
differential box counting [16][17][18][19]. 

Fractal appears in image application such as analysis, 
segmentation, classification pattern recognitions etc. [20]. Fractal 
dimension (FD) is valuable feature in texture segmentation and 
muzzle image classification. Box counting technique frequently 
used to estimate fractal dimension of muzzle images. The box 
counting algorithm is useful for determining fractal properties of 
ID segment, a 2D muzzle image or a 3D array. Box counting 
method is the most frequently used and more popular algorithm. 
Here the box counting algorithm steps. The box counting 
dimension associated with the concept of self-similarity where a 
muzzle image sub-divided into smaller elements and each small 
part replace the original muzzle image. Box counting dimension 
algorithm Db of any bounded subset of A in R n, which is a set in 
Euclidean space. Let Nr(A) be the smallest number of the set of r 
that cover A. Then 
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From equation (8) it is possible to make an analogy to the 
equation of a straight line, y = mx ± c, where m is the slope of 
the line and c is the y intersect. The box-counting dimension is 
implemented by placing a bounded set A, in the form of a muzzle 
image, on to a grid formed from boxes of size r X r . Grid boxes 
containing some of the structure, which in the case of a muzzle 
image is represented by the grey-levels within a certain range, are 
next counted. The total number of boxes in the grid that contain 
some of the structure is N (A) r. The algorithm continues by 
altering r to progressively smaller sizes and counting N„ . The 
slope of the line fitted through the plot of log (1/ r) against 
log JWjtiis the fractal, or box-counting, dimension of the 
bovine muzzle image region under investigation 



IV. Experimental result 

A. Database 

Muzzle database is the first challenge that we face when we 
start this research because of insufficiency muzzle printed 
database. Our muzzle database consists of 53 different cattle 
muzzle, each cattle has twenty captured image, this database for a 
real time cattle which collected by Dr. Hamdi Mahmoud. A 
sample of four different images to three muzzle captured image is 
shown in the following figure. 




Dl, 



Provided that the limit exists. 



(5) 



Subdividing R" into a lattice of grid size rxr where r is 
continually reduced, it follows that N' r (A) is the number of grid 
elements that intersect A and D\, is, 



D b 



(6) 



Provided that the limit exists. This implies that the box counting 
dimension Db and Nr(A) are related by the following 
po wer la w rela tion, 




A sample of muzzle printed images from live animals. This 
figure represented muzzle print images have been taken from 
three different animals. 

B. Histogram equalization 

Histogram equalization widely employed as enhancing algorithm. 
We can see that the image's contrast has been improved. The 
original histogram has been stretched along the full range of gray 
values, as we can see in the histogram equalization in Fig. 1 . 



CO 



Proof of this relation can be obtained by taking logs of both sides 
of equation (7) and rearranging to form equation (8), 



(8) 
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histogram of the gray scale image 




□ 1 □□ 200 



histogram equalization image histogram of the equalization image 




□ 1 □□ 200 



Fig. l. muzzle image before and after histogram equalization 

By compare histogram to three different images to the same 
bovines we found that histogram is symmetry as in the fig. 2. 



histogram ofthe equalization image histogram ofthe equalization image histogram ofthe equalization image 
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image (aj image (b) image (c) 

Fig. 2. Three different image to the same bovine muzzle 
By compare histogram to another three different image to 
the same bovines we found that histogram is symmetry as in the 
fig-3. 



histogram ofthe equalization image histogram ofthe equalization image 




O 1O0 2O0 O 10O 200 



image (a) image (b) 



histogram ofthe equalization image histogram ofthe equalization image 
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Fig. 3. Four different image to the same bovine muzzle 

The histogram to different muzzle is dissimilar as in fig. 4. 



histogram of the equalization image histogram of the equalization image 




image (a):- histogram to first muzzle image (b):- histogram to second muzzle 

Fig. 4. histogram to two diferent muzzle image 

In image processing the histogram equalization is the 
process which shows the appearance of each intensity value in 
muzzle image. Histogram graph show number of pixels at each 
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different intensity value. For example if the image has 9-bit gray 
scale this mean that there are found 512 different intensities 
value, so the histogram graph show 512 numbers which show 
pixels distribution among each gray scale values [21]. Histogram 
equalization increase image contrast because it specify the 
intensity value of the input image pixels, so histogram aim is that 
the output graph contain a uniform distribution of intensities. 
Histogram equalization method increases global image contrast 
[22]. 

C. Mathematical morphology filtering (MM) 

Opening operation of A by B obtained by erosion of A by B then 
dilation ofthe resulting muzzle image by B. Return to fig. 5 that's 
contain the result image to muzzle after opening operation. 




Fig. 5. opening operation on different muzzle 
The output muzzle image that is in fig. 5. is used as input image 

in close operation. Closing operation of A by B obtained by 
dilation of A by B then erosion ofthe resulting muzzle image by 
B. The result image illustrated in fig. 6. 




Fig. 6. close operation for two different muzzle 



After image preprocessing operations histogram equalization 
(HQ) and mathematical morphology filtering (MM) low contrast 
muzzle transformed to high contrast and noise removed 
respectively. Now the muzzle image become ready to the second 
step that is texture feature extraction. 

D. Texture feature extraction using box counting method 

The implementation of box-counting in different image to the 
same bovine result to the same result return to fig. 7. That show 
2D box- counting to four different image to on bovine muzzle. 
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Hfc. 7. box-eounting to four different Image to the same bovien 

Fig. 8. Show another result to different bovine muzzle 

2D box-count 



2D box-count 




r. box size 

Fig. 8. box-counting to three different image to the same bovien 

Previous after implementing box-counting algorithm on 
different muzzle image and each muzzle has different image the 
experimental result show the high similarity between the different 
image to the same bovine muzzle as in the following chart. 

muzzle feature extraced using box -counting 
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The fractal dimension to four images to one bovine which has 
histogram appears in fig. 3. And box counting result in fig. 7 is 
appr oximately equal. 



Fractal dimension, FD = 


1.6827 +/- 0.32645 


Fractal dimension, FD = 


1.7086 +/- 0.30032 


Fractal dimension, FD = 


1.7132 +/- 0.30303 


Fractal dimension, FD = 


1.7086 +/- 0.31821 



The fractal dimension to three image to another one bovine which 
has histogram appear in fig. 4. And box counting result in fig. 8 is 
approximately equal. 



Fractal dimension, FD = 


1.8424 +/- 0.13487 


Fractal dimension, FD = 


1.8477 +/- 0.17476 


Fractal dimension, FD = 


1.8289 +/- 0.15011 



Another three different image to the same bovine. 



So we can use this result in muzzle image classification and 
identification. 



V. Conclusion AND Future Work 

This paper has presented feature extraction to database 
consists of muzzle image. First, the statistical representation 
preprocessing techniques (Histogram equalization and 
mathematical morphology filtering) have been used in order to 
increase the contrast of each image and remove image noise. 
Then, box-counting algorithm has been used to extract the feature 
vector to each muzzle case. Simulation results have shown that 
the statistical representations for different images of the same 
muzzle are correct. The disadvantage of the statistical 
representation is that in case of lager amount of data (image), we 
need more intelligent techniques to give us accurate results. So, 
in the future work we may use big data algorithms together with 
intelligent soft comporting techniques. 
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ABSTRACT 



Energy meters in Nigeria have dominantly been electromechanical in nature but are gradually being 
replaced by more sophisticated and accurate digital and electronic meters. Today, a high percentage of 
electricity revenue is lost to power theft, incorrect meter reading and billing, and reluctance of consumers 
towards paying electricity bills on time based on postpaid meter. Considerable amount of revenue losses 
can be reduced by using Prepaid Energy Meters. A prepaid energy meter enables power utilities to collect 
energy bills from the consumers prior to the usage of power by delivering only as much as what has been 
paid for. This research provides a prepaid energy meter behaving like a prepaid mobile phone. The meter 
contains a prepaid card similar to mobile SIM card. The prepaid card communicates with the power 
utility using mobile communication infrastructure. Once the prepaid card is out of balance, the consumer 
load is disconnected from the utility supply by the contactor. The power utility can recharge the prepaid 
card remotely through mobile communication based on customer requests or consumer purchasing 
recharge card. A prior billing is bound to do away with the problems of unpaid bills and human error in 
meter readings, thereby ensuring justified revenue for the utility. Over the past several years, smart cards 
have achieved a growing acceptance as a powerful tool for security, identification, and authorization. The 
increasing computational power placed on the chip along with advances in cryptography has made the 
smart card a very powerful tool for identification. The advent of multi-application smart card operating 
systems for both contact and contact less applications has put smart cards on the edge of information 
technology. The proposed system uses an IP-based controller for the prepaid meter and the load meter 
and the responsibility of Load meter is to provide a simple way of detecting electricity power theft 
without any human intervention. The Load meter would indicate exact building or location and 
distribution line on which unauthorized taping is done in real time. It would be time saving if distribution 
company personnel take reading by this wireless technique and also it would provide a digital record in 
case of any judicial dispute which will be use for comparative analysis between the prepaid meter. The 
idea is to maximize the profit margin of power utility company, efficient online control of the total 
amount of electricity consumed in a specific location and be able to detect when there is bypass by the 
user either by shoot- hunting without connecting the cable through the digital meter or parts of the 
equipment are connected through to the smart meter why high voltage equipment are bypassed. 

KEYWORDS: Prepaid Meter, IP-Based Controller, Load Meter. 
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INTRODUCTION 



The Generation, Transmission and Distribution (T&D) as well as supply of electricity involve huge 
operational losses. The magnitude of these losses is rising at an alarming rate in several countries. In 
order to identify illegal consumers of electricity in the view of enhancing the economy of utilities, 
efficiency and security of the grid, a new method of analyzing electricity consumption patterns of 
customers and identifying illegal consumers is proposed and realized. Nigeria electric power network 
operator, Power Holding Company of Nigeria (PHCN) has over the years being faced with the problem 
of revenue collection. This is mostly because electricity bills are sent to consumers after it has been 
consumed. Consumers are reluctant to pay electricity bills due to estimated bills and unreliable and 
irregularity power supply. The low reliability of electric power supplies has little or no impact on the 
network operator because whether there is power or not, the normal estimated monthly electricity bills 
are sent to consumers in the post-paid method. Therefore, the consumers suffer the cost of generating 
power for their individual usage and the cost of electricity that was never supplied by PHCN. Due to the 
huge debt owed by customers, the network operator introduced a cash collection policy called Revenue 
Cycle Management (RCM) that involves using private companies in the collection of monies owed. This 
seems not to yield the expected results; hence PHCN introduced the digital pre-paid meter in 2006 which 
operation is similar to the loading of recharge card in the Global System for Mobile communication 
handset. If power is available and the pre-paid meter is loaded with units, the loaded units decreases only 
when load is connected and stops when power fails. 

In the last decade, smart cards evolved from basic memory cards to complex systems on chips with 
expanding processing power. This has opened the avenue to many applications such as financial 
transactions, e-commerce, physical access control, health, and transportation services. The smart card, an 
intelligent token, is a credit card sized plastic card embedded with an integrated circuit chip. It provides 
not only memory capacity, but computational capability as well. A smart card usually consists of a ROM 
or flash memory, EEPROM and a CPU. Access to data stored on the card is under the control of the 
smart card operating system. The card operating system not only makes the smart card secure for access 
control, but can also store a private key for a public key infrastructure system. Lately, the industry has 
come up with 32-bit smart card processors having more than 400Kbytes of EEPROM, and a memory 
management and protection unit serving as a hardware firewall. This hardware firewall enables secure 
separation of adjacent applications, as well as being the basis for secure downloading of applications. The 
self-containment of smart card makes it resistant to attack as it does not need to depend upon potentially 
vulnerable external resources. Because of this characteristic, smart cards are often used in several 
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applications which require strong security protection and authentication. In addition to information 
security, smart cards achieve greater physical security of services and equipment, because a smart card 
restricts access to all but authorized users. Furthermore, the smart card can be used as a credit/debit bank 
card which allows it to be used effectively in e-commerce applications. The multi-application smart card, 
along with the advent of open platform smart card operating systems, brings the only realistic option for 
managing multiple electronic transactions nowadays. It is a cost effective secure way to manage 
transactions electronically Manufacturers, issuers and users have recognized the value of one card that 
handles multi-applications. A multi-application card will be able to automatically update new services 
and existing applications, change and store user profiles for each application and be accepted by a range 
of devices-PC, POS, mobile phones. One of the most valuable applications is in using the smart card to 
buy energy. Domestic consumers could for instance buy energy, at a price based on their previous 
consumption pattern, from any supplier wherever and whenever they choose. When the customer wants 
to top up their electricity credit they visit a vending machine which uses the consumption data stored on 
their card to allocate a tariff and calculates how much energy to offer the consumer for their money. 
Recently, the portal technology has been playing an increasing role in computing. Service providers are 
rolling out portals to allow users to create customized web sites that display exactly the information on 
the Card and transformer. Corporations are rolling out portals to provide employees and business 
partner's customizable access to corporate information, including new feeds from external providers, or 
email, calendar and access to billing system, in addition to other web services. For web enabled energy 
services, and with the advent of home networking technology, power companies and service providers 
can provide value-added services delivered to the homes, like energy management, to generate additional 
revenue as well as to increase convenience and loyalty. In this research work, we propose a novel and 
simple prototype of a web enabled smart card based solution for controlling the consumption of 
electricity in a home environment, system that can calculate the total voltage consumption and the 
structure health condition of the transformer as well as the total voltage distributed by the transformer. 
Since the last decades of the past century, scientists, researchers and public people have been worried 
about energy conservation. People spend much more power than what they actually need and that results 
in a huge loss of energy. Moreover, the continuous increase in the universal energy prices has resulted in 
a huge economical loss. Thus we are proposing a prepaid electricity smart card based system so people 
can buy specific amount of energy to use it only when they need. People can register for this service and 
charge their accounts through the Internet. The proposed system is based on an IP-based controller called 
TINY, and a WATTNODE type power meter which interrupts the controller at a regular interval based 
on the consumption of electricity to update the balance based on a certain tariff. The power meter we 
used, interrupts the controller at a rate of 0.75Wph, so based on the particular tariff used and the amount 
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of power consumption needed, the correct amount of money to be loaded into the card can be easily 
calculated and programmed into the chip. The unique feature about this system is that the electric utility 
in the home environment can be accessed remotely from the supplier server due to the fact that the 
controller is IP -based, without the need for a PC on site, which reduces the cost of the system drastically. 
People now can buy electricity in advance, using the so-called prepaid electricity cards. The proposed 
prepaid smart card can also be used to manage electricity consumption in a hotel room, as well as 
accessing the room itself. Thus, people can consume only as much power as they really need. The main 
role of the smart card can be summarized as: 

• Authenticating the user or log in 

• Updating the balance in the card based on the given tariff and the electricity consumption profile 
of the user stored in the smart card. See Figure below for the proposed system. 

• Automatic respond on customer report on any bypass. 




FIG 1 . Flow Chart for Prepaid Electricity 
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FIG. 2 Stage of Activities in prepaid Card. 




FIG 3. Conceptual diagram of Pole based system 
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STATEMENT OF THE PROBLEM 



Electric energy meters, the direct billing interface between utilities and consumers for long, have 
undergone several advancements in the last decade. The conventional electromechanical meters are being 
replaced by new electronic meters to improve accuracy in meter reading. Still, the Nigeria power sector 
faces a serious problem of low revenue collection for the actual electric energy supplied owing to energy 



40 



http://sites.google.com/site/ijcsis/ 
ISSN 1947-5500 



thefts through bypass from digital meter. One of the prime reasons is the traditional billing system which 
is inaccurate many times, slow, costly, and lack in flexibility as well as reliability. Therefore, attempts are 
being made to automate the billing systems, even though more accurate and faster meter readings have 
seen the light of day; bill payment is still based on an old procedure. They require an individual/agent to 
personally come down to customer place and note the meter readings and report the amount one has to 
pay to the household/office. But the demand for computing power at all levels of electronic systems is 
driving advancements in semiconductor chip technology. The AMR and power quality monitoring 
systems manufacturers are taking advantage of these advances and integrating them into new meters and 
instruments. The networking technologies are driven by the demand for interconnection of computer 
users worldwide. The AMR and power monitoring systems are using these advances to expand the 
monitoring systems. A Prepaid Energy Meter enables power utilities to collect electricity bills from the 
consumers prior to its consumption. The prepaid meter is not only limited to Automated Meter Reading 
[AMR] but is also attributed with prepaid recharging ability and information exchange with the utilities 
pertaining to customer's consumption details. But, there primary objective were not met due to energy 
theft by bypassing a digital prepayment meter, which is an economic lost to the Government. 

AIM OF THE RESEARCH 

A secure smart card based system for e-payment, implemented on prepaid electricity over the internet, 
was proposed. The smart card system has been designed and implemented successfully using a three tier 
model client-server system. The proposed system has the benefit of using a secure smart card to log in to 
the network, and control the amount of money needed to be spent for the required electricity consumption 
based on the user profile stored on the card and also to give an automatic report on an bypass of voltage 
through the means of shoot hunting or short circuited without the user connecting through the smart card 
or quick Load meter system design to transmit total voltage consume within a define specified location. 
The proposed system also has the unique feature of using an IP-based controller which provides remote 
access to prepaid meter and Load meter information within that particular location in other to ensure 
total consumption which link to the server without any additional cost. 

OBJECTIVE OF THE RESEARCH 

• To explore and investigate various electricity payment system 

• To propose a model for IP Based smart card payment system. 

• To evaluate the proposed Model as a solution to bypassing 

• To design and implement these model as a Real time processing in solving the problem of 
bypassing. 
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• To implement a real time Load meter system use to transmit total voltage consume within a 
particular location 

RESEARCH QUESTION 

• Are these various methods of electricity billing system prevented fraud? 

• Has the introduction of pre-paid meters aided revenue generation in Nigeria? 

• What is the best method in solving the problem of bypassing? 

• What is the means of identification of total voltage availability in the building? 

• How can bypass be detected? 

REVIEW OF RELATED LITERATURE. 
Introduction 

As deregulation takes hold, customer competition and care is gathering pace as the wave in the utility 
industry. And central to this, linking utility and customer, is the meter, whose role is becoming 
increasingly important with impact on all areas of the metering business from metering practice and 
metering technology to billing. 

Metering according to Simpson (1996:14) is the process and methods of utilizing devices to measure the 
amount and direction of electrical energy/flow, particularly for end-use. " he also defined metering as " 
installation of equipment that makes it possible for a utility to determine the amount of electric power a 
particular customer has consumed. " electricity is provided to customers by wires, often called service 
drops, emerging from distribution transformers. These wires go into the electric meters that measure the 
quantity of electricity used (measured in kilowatt-hours.) 

Meter 

According to Austin (2002). "The meter is both a means of measuring energy supply and also a critical 
sales and marketing tool, which has a correlation with company's profitability." Meters are for correct 
measurement of electricity to ensure customers pay for their consumption while enabling utilities to 
charge based on what has been supplied. According to Leitner (1998:231 ), customers need value for the 
money spent in installing the meters; therefore, the meter which gives value for the money must be 
functioning properly at all times. The meter is typically located where the utility hands off the delivery of 
electricity to the customer. Generally the customer is responsible for purchasing and maintenance 
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equipment past this point. Thus there is need for proper management and maintenance of the meters for 
effectiveness and efficiency in the prepayment Metering System. Utility business and operational 
services should no longer be segmented- for example, linking energy management tools (such as load 
profiling, monitoring and control and demand response management) to a metering and customer 
information system (CIS) will encourage customer loyalty, reduce demand and assist in managing costs" 
(Metering Billing and CRM/CIS), 17-19 February 2004, Kuala Lupur, Malaysia. 

Metering as a System 

The meter is key tool in any utility requiring efficient operation and maintenance as it links the utility 
with customers (Kettless 2004:56). Thus, the General System Theory (Chadwick 1978) has been adopted 
as the conceptual framework for the study. A prepayment Metering System has viewed as a system with 
many interrelated sub-systems, the General System Theory is one major theory at the root modern 
scientific approach to management Chadwick (1978:36) defines a system as " a set of objects together 
with relationships between the objects and between their attributes," while Cole (1995) looks at it as an 
interrelated set of activities which enables inputs to be converted into outputs. Systems may be closed or 
open. Closed systems are those which for practical purpose, are completely self-regulating and thus not 
interact with their environment. Therefore, in the context of this study, a prepayment metering system 
cannot be closed as it relies on inputs from the community where it is stimulated for its survival. In this 
regard, Cole (1995:70) defines an open system as that which interacts with its community, on which it 
relies for obtaining essential inputs and for the discharge of system output. The inputs include people, 
materials, information, and finance, which are organized and activated so as to convert human skills and 
materials into products, services and other outputs that are discharge into the environment. The most 
important element of an open system is therefore, their inter-dependence on the community, which many 
are relatively stable or relatively uncertain at a particular point, (Cole 1995:71). 

According to Clelland (1968:15), the management task of integrating various elements in the system is of 
paramount importance and this can only be effectively accomplished if the manager adopts the General 
Systems approach to the system he is managing. The system concept to prepayment metering in utilities 
is, therefore, a simple recognition that a prepayment metering is a system made up of sub- systems or the 
Master station, meter and the vending machine, each of which has its goals to achieve. In other words the 
meter record how much electricity has been consumed by the customer from the total produced by the 
utility while the master station ensures the customer has the correct amount of electricity sold to him 
through the vending machine and administers the whole system. "The boundaries between these sub- 
systems called interfaces may be external or internal." (Cole 1995:72) Consequently, in a system, some 
sub-systems have to deal with the inputs and the output for the system to work consistently at the external 
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boundary e.g. meter reading, billing customers and distribution of bills and so forth. On the other hand, 
other sub-systems deal with consistent provision of services to others in the system at the internal 
boundaries e.g. human resources, management and accounts. 

Prepayment Metering System 

Prepayment Metering is a well established technology being introduced by more and more utility 
companies. According to Keltless (2004), "a Prepayment Metering System is a system where a customer 
pays for energy before using it." A Prepayment Metering System according to Kettless (2004:105), 
basically comprises a system master station (which is a computer that operates and administers the whole 
system), a vending machine (where customers buy their electricity) and prepayment energy meters (or 
dispensers, which dispenses the electricity to the customer). This meter has an interface to the customer 
for managing the transfer of credit and to display the meter and credit status. In this study, the benefits 
and problems of the Prepayment Metering System can only be assessed by looking at various subsystems 
as a whole and seeking to understand and measure the effectiveness of the system. Prepayment metering 
systems are basically categorized as either one way or two-way, referring to the flow of information 
between the vending machine and the meter. 

In the one-way system the information flows only in one direction, from the vending machine to the 
meter. This system can either be addressable or non-addressable. The addressable system uses tokens 
that are personalized to one meter and therefore cannot be used to credit any other meter. In the two-way 
system, information flows in both directions. In this system the meter also returns to the vending 
machine, information such as peak demand, average daily consumption etc. The system inherently 
requires expensive microprocessor smart cards, a sophisticated system of networked computers and 
vending stations. 

Historical Development of Metering 

Up to the 1870s, electricity had little use beyond the telephone and telegraph. The earliest 
use of electricity for power was to operate stings of arc lamps connected in series. "Since the current was 
constant and the voltage required for each lamp was known, and all the lamps were controlled by one 
switch, it was adequate to measure only the time current flowed in the current (lamp-hours)" (Lamphier, 
1925). However, after the invention of the incandescent lamp by Edison in 1879 and the subdivision of 
lighting circuits for individual control of lamps, it was no longer practical to measure lamp- hours, but 
this practice continued into the 1890s. In 1882 Edison started up his first electric company for 
incandescent illumination. Initially he started out with a per-lamp rate. This was unsatisfactory so he 
developed a chemical ampere-hour meter that consisted of ajar holding two Zinc plates connected across 
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a shunt in the customer's circuit. Each month the electrodes were weighed and the customer's bill was 
determined from the change in their weight. However, this meter was inefficient and error prone because 
it was difficult to attribute all the change in weight to the flow of current. Edison did also develop a 
motor- type meter but preferred the chemical meter because of his interest in chemistry. Once the 
transformer was commercially feasible, it helped make the present system of AC transmission and 
distribution possible since it had none of DCs drawbacks at the time (voltage drop in long lines and lack 
of an easy way to increase or decrease voltage). "There was one major obstacle, however: There was no 
meter to accurately record the usage of electricity on AC circuits." (Lamphier 1925) 

In 1885 Galileo Ferraris of Turin, Italy made a key discovery that two of phase AC fields can make a 
solid armature rotate. This discovery spurred developments of induction-type motors as well as paving 
the way for the development of the induction type watt-hour meters. 

In 1886 Professor Forbes of London, England came up with the first meter for use on AC 
circuit that used a heating element connected into the circuit, which operated a small 
windmill connected to a register. Unfortunately this meter was far too delicate for commercial use. 

In April 1888 at Westinghouse, Shallenberger and an assistant were working on an AC arc lamp when a 
spring fell out and came to rest on a ledge inside the lamp. The assistant reached over and was going to 
put it back when Shallenberger noticed the spring had rotated. After he realized that the spring had 
rotated due to rotating electric fields in the lamp he seized the opportunity and designed an AC ampere- 
hour meter in just 3 weeks, and it went on the market 3 months later. Over 120,000 Shallenberger 
ampere-hour meters were sold over the next 10 years (Hammond 1941). Thomson introduced his 
recording wattmeter in 1889. This was the first true wan-hour meter, and it was an immediate commercial 
success, many utilities adopting it as their "standard" model. Although this meter was initially designed 
for use on AC circuits, it worked equally well with the DC circuits in use at the time. The introduction 
and rapid acceptance of induction-type watt-hour meters in the late 1890's relegated the use of this 
commutator-type meter to DC circuits. With the rapid growth of the electric industry by 1894, AC was 
now being used to run motors, and the existing ampere- hour meters and commutator-type watt-hour 
meter were unable to take into account varying voltages and low power factors on AC circuits. Several 
inventors worked to develop a new meter to meet this need, but Shallenberger hit on the most workable 
approach- a small induction motor with the voltage and current coils 90 degrees out of phase with each. 
"This concept was refined into the first commercially produced induction watt-hour. This model was one 
of the heaviest ever offered at 41 pounds and one of the most expensive of its time." (Hammond 1941). 
However, he did not construct instruments that would measure the full continuum between the two 
circuits of AC and DC. 

45 http://sites.google.com/site/ijcsis/ 

ISSN 1947-5500 



Credit Metering 

Credit Metering is the traditional way of metering. This involves installing a meter at the customer's 
premises. The meter records the number of units, usually in kilowatt- hour, consumed every time power 
is switched on. The meter is read usually once in a month to determine the number of units that have been 
consumed by a customer and based on that consumption the customer is billed. The customer only pays 
when the bill has been made and presented to him. This system gives the customer a leeway in terms of 
time in which to settle the bill. 

Prepayment metering System in the United Kingdom 

Technology such as electronic metering systems with more advanced features, new prepayment metering 
technologies, and sophisticated utility software packages provides utilities with reliable, cost-effective 
meter-to-operations centre infrastructure. The challenge from a utility management perspective is how to 
capitalize on the opportunities to increase revenue, decrease costs and improve customer satisfaction. It is 
about extending the utility business well beyond microprocessor-based electronic hardware and enabling 
the utility to introduce systems that realize significant cost savings, efficiency gains and revenue 
enhancements. Revenue collection is one of the core activities of any utility. This has traditionally been 
accomplished using conventional credit meters with regular meter reading, extension of credit to 
customers and normal credit collection mechanisms. But as utilities around the world are coming to grips 
with deregulation, there is an increasing need for them to review existing business processes so as to 
make them more cost effective and customer friendly (Leitner, 1998). 

Kettless ( 2004 ) states that Prepayment metering has been in use in the United Kingdom for well over 70 
years, and with over 3.9 million electricity consumers on prepayment metering alone. The UK is seen as 
the world focus for prepayment development. This is further borne out by the types of token-based 
prepayment systems that have been introduced and developed over the last few years for the UK, which 
are now being marketed worldwide. The system ranges from the use of magnetic cards, key-based 
facilities, smart cards and so on. "It is interesting, however, to look back at the roots of prepayment 
systems, both from engineering and a social point of view. Mention prepayment metering to most utility 
staff and it will immediately conjure up images of consumers with bad debts" (Kettless, 2004:104). The 
UK has a long history of the use of coin operated meters, which allow a customer to pay for his 
electricity as he consumes it. By the late 1970s there were growing problems with coin operated meters. 
Some of them were: They were unreliable, the average 'life' on circuit being typically-5 years before 
needing attention. Although the customer paid in advance, the cash stayed in his meter until collected; 
hence it was not true prepayment from the utility view point. 
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Cash was stolen from meters during burglaries, leaving the customer responsible for replacing it (and 
some customers 'stole' from their own meter). A meter might typically contain in excess of 100 pounds 
between collections. Staff collecting cash were targeted by criminals in robberies, to the extent that some 
collections had to be carried out using armored vans. 

The electricity industry therefore encouraged research into alternative methods whereby 
prepayment facilities could be given without involving cash at the customer premises. Papers from that 
time record trials of, for instance, plastic coins which were crushed after insertion into the meter, and 
magnetic card tickets being developed by London Transport for the Underground system. Developments 
accelerated in the mid 1980s when the Prime Minister- Margaret Thatcher, launched an anti crime 
initiative, which included the problems of theft related to coin operated meters. By then manufacturers 
were close! To offering commercially viable new meters, but it was still early days and the industry was 
cautious about making a commitment to replacing the over one million operated meters. 

One reason for caution was the need for (and cost of) a network of vending machines to support the meter 
base. "Roll-out of new prepayment meters was helped by two major reports prepared by the Electricity 
Industry (EI) around 1988 - one covering card and the other key technologies. Both concluded, taking 
into account benefits from cash flow brought forward." (Price 2001 :62) 

Austin (2002) states that the prepayment industry is in a process of continual evolution as 
newer technologies emerge which provide enhanced functionality and allow utilities to offer innovative 
value-added services. Great strides continue to be made in the measurement techniques of prepayment 
meters, providing utilities with richer engineering services such as power quality monitoring, 
consumption history, time of use and meter status information. The key to gaining the greatest return and 
efficiencies from this information is to implement two-way feedback between the meter and the 
management system. This can be done as part of the purchase cycle, whereby data is transferred from an 
intelligent token device such as a smart card, button or electronic token key. When," new token is 
purchased, the vending outlet downloads the feedback data. However, this method relies entirely on the 
customer's purchasing cycle, which may be month to month, and thus provides no immediacy for 
utilities. According to Austin (2002), the demand for more sophisticated meters creates a ripple effect, 
whereby the revenue management system - the nerve centre for successful prepayment solutions, as it 
records all consumer transactions -undergoes a process of continual improvement to meet the metering 
needs. Reports currently generated from the data-rich management system allow utilities to monitor items 
such as consumer purchase patterns, determine energy load requirements, reconcile to bulk metering 
devices, detect non-technical losses and plan the maintenance of their systems. Simpson (1996) states 
that, advances in metering and communications have meant that many utilities throughout the world are 
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turning to two way communications technology to provide better and more efficient services for their 
customers. A progressive solution using power line carrier communications has been tried by the Scottish 
Power using the Siemens Mains Master 2000 system. Scottish Power believe this trial to be a world first 
in providing an operating token less prepayment system using power line carrier technology without any 
other metering back up. The fact that there are now 3.9 million prepayment customers, as opposed to no 
more that 1.5 million coin meter customer at peak, partly indicates that the new systems provides better 
benefit to both customers and the utility and partly relates to their use for other reasons, supporting the 
reasoning that coin operated meters are an alternative to disconnection of those unable to pay their bills 
despite its own drawbacks. For example, credit customers found to have interfered with their meters so as 
to avoid payment may be required to have a prepayment meter fitted as a condition of continued supply. 
In addition the new meters are flexible as regards tariff and some 25% of meters installed support a two- 
rate heating tariff. "Two major studies - one initiated by the Regular himself - have concluded that there 
is high customer satisfaction (around 90% ) with the systems and the budgeting flexibility they offer." 
(Dick 2003:153) The use of prepayment meters also simplifies accounts considerably, more particularly 
in those cases where the tenants are constantly changing; for possibly two or three accounts might have to 
be sent out per quarter, to say nothing about the difficulty on occasions of finding the 'leaving' consumer. 
The scope of the prepayment meter is, however, not confined to the poor man's dwelling. Its use in flats 
is gradually becoming more extended, and in furnished apartments, where the consumers are chiefly 
nomadic. It relieves the proprietor of all responsibility as regards the consumption, over which he has 
practically no control" (Ferns, 1938:64). 

Electric energy meters, the direct billing interface between utilities and consumers for long, have 
undergone several advancements in the last decade. The conventional electromechanical meters are being 
replaced by new electronic meters to improve accuracy in meter reading. Still, the Nigeria power sector 
faces a serious problem of lean revenue collection for the actual electric energy supplied owing to energy 
thefts. One of the prime reasons is the traditional billing system which is inaccurate many times, slow, 
costly, and lack in flexibility as well as reliability (Devidase? al, 2010). Therefore, attempts are being 
made to automate the billing systems. Even though more accurate and faster meter readings have seen the 
light of day, bill payment is still based on an old procedure. They require an individual/agent to 
personally come down to customer place and note the meter readings and report the amount one has to 
pay to the household/office. But the demand for computing power at all levels of electronic systems is 
driving advancements in semiconductor chip technology. The AMR and power quality monitoring 
systems manufacturers are taking advantage of these advances and integrating them into new meters and 
instruments. The networking technologies are driven by the demand for interconnection of computer 
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user's worldwide (Chandler, 2005). The AMR and power monitoring systems are using these advances to 
expand the monitoring systems. 

In the recent past, several techniques were proposed for detecting the location of direct tapping on a 
feeder or tampered energy meter and identifying illegal consumers. On a parallel track, some non- 
technical measures, such as inspection of customers with suspicious load profiles and campaigning 
against illegal consumption, were also implemented to control electricity theft. Some of the techniques 
(proposed worldwide) are described in this section. A good strategy for fighting corruption in utilities has 
to be developed, considering the political scenarios, business processes, management techniques, and 
technologies in metering and distribution monitoring, control and automation based on the geographic 
location. In addition to the non-technical measures presented earlier, regularization of agricultural 
connections needs to be done. All of the contracts for deployment and maintenance of the distribution 
sector must be outsourced based on the performance of the enterprise to which the bid will be awarded. 
In addition, in most countries, electricity theft is considered a serious offense and illegal usage in any 
form of unbilled energy belonging to a utility is punishable under law. Laws and policies are being 
enforced such that political leaders do not protect corrupt employees and illegal consumers responsible 
for theft. A constituency has been proposed to be created through effective communication with the 
important stakeholders, institutionalization of new business processes that adopt modern technology, and 
improvisation of management information systems. Periodic inspection of illegal connections involves a 
lot of labor and strain for vigilant officials. The shunts detecting equipment proposed are time efficient 
and help in the detection of electricity theft in underground distribution cables. Revenue Assurance and 
Audit Process (RAAP) is composed of macro-functions to detect and analyze revenues involved in illegal 
consumption of electricity. Also, Mano R. et al. suggests proper design and implementation of rules in 
the investigation of illegal consumers. RAAP is targeted at improving the revenues for the utility by 
reducing commercial losses at about 20% each year. In India, the Electricity Act of 2003 has made 
electricity theft a punishable offence and gave full freedom to vigilance officials to inspect and detect 
illegal consumers. In Pakistan, Karachi Electric Supply Company (KESC) has obtained a fatwa or decree, 
from Islamic scholars, declaring that illegal consumption of electricity is a sin. On the other hand, teams 
are arranged for inspection and detection of illegal consumers of electricity, and their reward depends on 
the number of cases they inspect. Such incentives are proportional to the total number of illegal 
consumption cases they detect. Several technical measures were also implemented in order to detect and 
help utilities in their battle against NTL. 

Installation of a prepaid energy meter can be a solution to monitor the distribution system and control 
electricity theft. Location of electricity theft on a distribution feeder can be detected based on the values 



49 



http://sites.google.com/site/ijcsis/ 
ISSN 1947-5500 



of the phase angle and impedance of the transmission lines at two different operating frequencies 
respectively. Bandim C.J. et al. proposed utilization of a central observer meter at secondary terminals of 
distribution transformer. The value of energy read by the central observer meter is compared with the 
sum of energy consumption values read by all energy meters in range. These two values of the current are 
compared to estimate the total electricity that is being consumed illegally. Vigilant Energy Metering 
System (VEMS) is a proposed energy metering system that can fight electricity theft. It has the ability to 
collect, transfer and process data between other energy meters, local station and base station. It also 
identifies probable locations of theft and helps the utilities to control theft. A remote billing system can 
also be developed modifying this model. Illegal consumption of electricity can be detected by using a 
remote check meter based on the amount of losses and time stamp of the check meter. This method is 
implemented before inspecting the illegal consumers personally by the vigilance officials, based on the 
data at the proper frequency of the consumer measurements. A microcontroller based energy meter 
proposed by Jamil M. et al, gives utilities the ability to monitor and control the power supply of its 
spatially distributed consumers. This meter acts as a check meter that helps detect meters that have been 
tampered. In addition, e-metering systems can collect and process data, as well as detect abnormalities in 
load profiles indicating electricity theft. 

Nagi J. et al. proposed a novel approach of using Genetic Algorithm-Support Vector Machines (GA- 
SVM) for detecting illegal consumption of electricity. Load consumption data of all the households is 
collected, and data mining techniques are used to filter and group these customers before detecting illegal 
consumption. Customers are grouped into different classes based on the extent of the abnormality in load 
profile and customers with high probability of theft are personally inspected. The Extreme Learning 
Machine (ELM) approach is used to evaluate abnormal load behavior indicating electricity theft based on 
a load-profile evaluation. Nizar A.H. and Dong Z.Y. used online sequential-ELM (OS-ELM) algorithms 
in detecting and grouping the load profiles to reduce. 

A Prepaid Energy Meter enables power utilities to collect electricity bills from the consumers prior to its 
consumption. The prepaid meter is not only limited to Automated Meter Reading [AMR] but is also 
attributed with prepaid recharging ability and information exchange with the utilities pertaining to 
customer's consumption details. The idea of prepaid metering will be very important for the new research 
fields of Micro-grid and Smart Grid and is an inevitable step in making any grid smarter than it is now. 
Literature has witnessed quite an amount of work in this area. The use of electronic token prepayment 
metering has been widely used in UK for customers with poor record of payment (Southgate et al, 1996). 
A paper suggests a design of a system which can be used for data transmission between the personal 
computer and smart card. The device will transmit the data in half duplex mode (Kwan et al, 2002). The 



50 



http://sites.google.com/site/ijcsis/ 
ISSN 1947-5500 



system designed in this project can be used to develop more complex system where a smart card can be 
used to several applications including prepayment. Another paper features a 3 -tier smart card secure 
solution for a novel prepaid electricity system. It uses an IP-based controller in addition to a power meter, 
providing efficient online control of the amount of electricity consumed by the user (Raade? al, 2007). 
Prepaid meters can also make use of state of art technologies like WiMAX owing to the idea of 
centralized accounting, monitoring and charging. It brings telecommunication to the core of its activities 
to support more Smart Grid applications such as Demand Response and Plug-in electric vehicles (Khan et 
al, 2007). Prepayment polyphone electricity metering systems have also been developed consisting of 
local prepayment and a card reader based energy meter (Ling et al, 2010). In this paper, we have 
attempted to initiate a different idea of using mobile communication to remotely recharge as well as bill 
the consumer's energy consumption. A prepaid card capable of communicating with power utility using 
mobile communication is attached to the energy meter. 

A discussion of smart metering is often accompanied by a good deal of confusion about purpose and 
functionality, so it is necessary to start with definitions. To begin with a basic definition, 'smart' meters 
are primarily 'non-dumb', i.e. they communicate electronically, as: advanced meters that identify 
consumption in more detail than conventional meters and communicate via a network back to the utility 
for monitoring and billing purposes. (Climate Group, 2008, p. 85) It is not always necessary to replace a 
meter in order to achieve smartness: a 'dumb' meter can be 'smarted' by retrofitting it with 
communications capability and this is a less expensive option, for comparable specification. 
(Dimitropoulos (2007) gives a useful appraisal of costs and benefits of the equipment and rollout options 
open to UK utilities.) Taking the definition a little further, the literature shows general agreement that a 
fully smart meter is one that can measure and store data at specified intervals and act as a node for two- 
way communications between supplier and consumer and automated meter management (AMM). This 
allows for a radical change in customer-utility relations, with the possibility of remote disconnection and 
reconnection, remote change of tariff, and remote change in 'contractual power' (the peak electrical 
demand allowed for an individual customer, a familiar concept in Italy and France, for example). Simpler 
versions of communicating meters, usually referred to as 'advanced' rather than 'smart', have one-way 
communications only, from customer to utility. These are referred to as automated meter reading (AMR) 
meters, and have been used by industrial and commercial customers for many years, typically measuring 
consumption at half-hourly intervals for electricity, hourly for gas (Owen and Ward, 2006). They ensure 
accurate billing, make supplier switching more straightforward, and detect fraud more easily than 
standard credit meters. The term 'advanced metering infrastructure' (AMI) refers to the system of meters 
and associated communications. AMI offers accurate, fraud-resistant measurement (e.g. it tells the 
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supplier when usage is suspiciously high or low, or when there is evidence of tampering), and improved 
information flows to enable demand response - the management of demand in relation to prevailing 
supply conditions (Batlle and Rodilla, 2008). It allows for communication hubs that can be used for the 
remote control of electrical appliances, in order to optimize network operation and the use of intermittent 
renewable supply. And it offers the prospect of integrated metering and recording for consumption and 
on-site generation. The most ambitious form of AMI, the 'smart grid', is planned to carry out load control 
at high resolution (the remote control of individual appliances from second to second), in order to cope 
with fluctuations in supply as well as demand. This is expected to become increasingly necessary as more 
intermittent renewable generation comes on stream (for an overview of the scope of smart grids, see 
European Commission, 2006a). Smart grids are still in the early pilot stage. A further definitional twist, 
but an essential one for this paper, comes from the separate development of electronic consumption 
displays, or in-home displays. These are widely (but misleadingly) known as 'smart meters'. Most 
display electricity usage, with a few also showing gas and water consumption. While many models are 
designed to operate with conventional meters, by sending signals to a display panel from a transponder 
attached to the meter tail, some recent models can operate with smart meters, showing accurate data that 
coincide with billing information. 

TYPES OF PREPAYMENT METERS 

Electromechanical prepayment meters in general consist of two parts. These are the integrating (Kwh) 
meter which is usually a standard meter made by the manufacturer to operate on the type of circuit 
concerned and the prepayment mechanism. This mechanism is subject to considerable variation. 



Fixed charge collector - Hand-Reset Type 

This is the simplest form of prepayment metering, consisting of the meter and a switch. At each visit the 
Meter Reader removes the money and trips the meter's switch. The consumer then re-closes the switch by 
inserting the requisite amount of cash. 

Fixed charge collector - Time Switch Type 

In this type of meter the tripping of the switch is performed by an electrically driven clock mechanism. 
The advantage with this meter is that it allows more frequent operations than the hand-operated type, and 
thus needs less amount of money each time to close the switch. 
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Flat rate tariff meter 



In one form or another, this type of meter is the most commonly used? Turning the coin knob in this type 
of meter after the insertion of a coin in the slot advances a mechanical credit register the appropriate 
amount, and also closes the switch if it is not already closed. The coin is also registered on a counter 
which indicates the total number of coins inserted since the meter's installation. As the meter registers 
energy consumption, a linked gearing arrangement causes the credit register to progress towards zero. 
When zero is reached a tripping device operates and causes the switch to break the supply. A differential 
device prevents interference between the coin mechanism and the metering register. The coin register 
drives the credit counter upwards when coins are inserted, but with no effect on the meter register, while 
the meter register drives the credit counter downwards with no effect on the coin mechanism. The force 
required to open and close the switch is provided by a strong spring, which is charged by the consumer 
turning the coin knob and discharged by a trip mechanism on the credit register when it reaches zero. 
Meter manufacturers adopt one of two ways of allowing for unit price changes. In one design the gearing 
between the coin knob and the credit counter could be altered; in the other it is the gearing between the 
meter register and tile credit counter. Either way allows the number of units per coin to be adjusted to suit 
the tariff. 

Two-part tariff - Fixed Rate Type 

The next major development in prepayment metering is the Two Tariff-Fixed Rate type. This meter also 
incorporates a continuously running, constant speed motor. The motor is attached to the credit register 
via a differential gearing arrangement with a meter register. The mechanism used to calculate credit is the 
sum of the speeds of both the motor and the meter register through the differential gearing. The motor 
itself comprises the fixed charge collector. Running continuously through the differential gearing, it 
reduces the available credit in the meter even when energy is not being consumed. Because the motor 
runs at a constant speed, credit in the meter is reduced at a fixed rate per hour in addition to the number 
of units consumed, with a range of gears allowing for different fixed prices to be set. 

Two-part tariff - Variable Rate Type 

This meter has replaced the fixed charge motor with a second gearing system connected to the coin 
mechanism. Insertion of coins diverts a proportion of the money to the credit register and the rest to a 
fixed charge register, reducing the pre-set sum of money owed. When the fixed charge register reaches 
zero the associated gearing is disengaged from the coin mechanism, so that all future coins inserted are 
used only for consumption. The disadvantage with this type of meter is that it makes electricity seem 
very expensive during the period that the fixed charge is being repaid. 
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Double tariff, Current Change-Over Type 



Here the load current passes through a relay in addition to the load switch and current coil. The relay is 
designed to change the gearing between the meter register and the credit register depending upon its 
position. The solenoid remains in the off position when load currents are low (e.g. lighting only) and the 
customer pays a rate that is low per unit. When the load current increases to a certain level the solenoid 
operates causing the gearing between the meter register and the credit register to change consequently 
reducing the consumer's charge per unit. This allows consumers to use heavy loads, such as irons, 
radiators and water heaters, at a reasonable cost, yet enabling the suppliers to obtain a fair price per unit 
when only lighting is being used. 

Double tariff, time Change- Over Type 

Similar in operation to the current changeover meter, this meter employs a clock mechanism to change 
the gearing between the meter register and the credit register at certain fixed times of the day or night. 
Suppliers can thus offer their consumers low rates per unit at night and other off-peak periods. 
Consumers thus benefit from the lower unit prices for their consumption, whilst suppliers benefit from 
improved load factors. Electrolytic Prepayment Meter. 

It consists of a glass cell or jar containing a quantity of copper nitrate solution and a fixed plate of copper 
acting as a cathode. The anode is provided by a strip of copper wound round a bobbin, which is fed a 
short length at a time into the electrolyte by the insertion of coins. The consumer's load (up to 4 amperes) 
passes through the cathode, anode and electrolyte, which results in the anode being dissolved and the 
copper being deposited on the cathode. Eventually, the anode would dissolve sufficiently to break the 
circuit at the surface of the liquid and is only remade by the consumer inserting more coins, causing more 
copper strip to be immersed into the solution. 

This meter suffers from a number of disadvantages. If overloaded, the copper deposited on the cathode 
has a tendency to become uneven and 'trees' often forms which eventually result in the electrolyte being 
short-circuited. The copper strip has to be replaced when used up and every two years or so the cathode 
has to be replaced and the electrolyte filtered and topped up. 
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Albania 

Electricity Regulatory Authority, Mr. Elis Sala, Chief Sector for Licensing/Monitoring 
Armenia 

Public Service Regulatory Commission of Armenia, Commissioner Samvel Arabayjan 
Azerbaijan 

Tariff (Price) Council of the Republic of Azerbaijan, Ms. Nurlana Musaeva, Chief adviser, The 
Department of Monitoring, analysis and methodology, Methodology and information sector 

Bosnia & Herzegovina 

State Electricity Regulatory Commission, Mr. Sasa Scekic, Head of Licensing and Engineering 
Department 

Chaiman of the Licensing/Competition Committee 
Bulgaria 

State Energy and Water Regulatory Commission, Commissioner Plamen Denchev 
Croatia 

Croatian Energy Regulatory Agency, Ms. Loridana Smoljanic, Head of the Legal Affairs and License 
Department 

Estonia 

Energy Regulatory Department of the Estonian Competition Authority, Ms. Riina Randma, Chief 
Specialist, Energy Regulatory Division 

Georgia 

Georgian National Energy Regulatory Commission, Commissioner Irma Kavtaradze 
Hungary 

Hungarian Energy Office, Ms. Zsuzsa Cseko, Analyst 
Kazakhstan 

Agency of the Republic of Kazakhstan for Regulation of Natural Monopolies, Ms. Kalila Tanatovna 
Kokkozova 

Director of the Department on regulation in sphere electro- and power system 
Kyrgyz Republic 

National Agency for Antimonopoly Policy and Development of Competition, Mr. Timur Orozaliev, Head 
of Price and Tariff Department 
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Latvia 

Public Utilities Commission, Ms. Digna Eglite, Head of the Energy Regulation Division in the Legal 
Department 

Lithuania 

National Control Commission for Prices and Energy in Lithuania 
Macedonia 

Energy Regulatory Commission of the Republic of Macedonia, Mr. Strasho Zafirovski, Head of 
Technical/Energy Department 

Moldova 

National Energy Regulatory Agency, Mr. Gheorghe Lupan, Head of Regulation and Licensing 
Department 

Mongolia 

Energy Regulatory Authority of Mongolia, Mr. Monkhtulga Monkhoo, Head of the Licensing 
Department 

Montenegro 

Energy Regulatory Agency of the Republic of Montenegro, Mr. Miodrag Djekic, Presiding of the board 
Poland 

Energy Regulatory Authority, Ms. Malgorzata Wesolowska, Head of Unit at the Department of the 
Energy Enterprises 

Romania 

Romanian Energy Regulatory Authority, Commissioner Maria Manicuta, Director, Licensing and 
Network Access Department 

Russian Federation 

Federal Tariff Service, Mr. Igor Kasaev, Deputy Head of Legal Department 
Serbia 

Energy Agency of the Republic of Serbia, Mr. Nikola Radovanovic Senior Expert for System and Legal 
Affairs 

Slovakia 

Regulatory Office for Network Industries, Ms. Michaela Urickova, Department of Strategic Analysis 
Turkey 

Energy Market Regulatory Authority, Mr. Hulusi Kara, Group Head, Electricity Market Department 
Ukraine 
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National Electricity Regulatory Commission of Ukraine, Mr. Yuriy Antonyuk, Head of Licensing 
Department 

Bosnia and Herzegovina 

Regulatory Commission for Electricity in Federation of Bosnia and Herzegovina, Mr. Ahmet Hukic, 
Head of Licensing and Technical Dept 

Bosnia and Herzegovina 

Regulatory Commission for Energy of Republika Srpska, Mr. Dragutin Petkovic, Head of Licensing 
Department 

Romania 

National Regulatory Authority for Municipal Services, Ms. Anca Cador Expert, Market Monitoring and 
Territory Consultancy Department 

UNMIK Kosovo 

Energy Regulatory Office, Mr. Afrim Ajvazi, Legal and Licensing Officer 
The Hashemite Kingdom of Jordan 

Electricity Regulatory Commission, Ms. Muna AlMusa, Licensing and Monitoring Engineer 



Nigeria 

Nigerian Electricity Regulatory Commission, Mrs. Olufunke Dinneh, Head, Legal, Licensing & 
Enforcement 

Saudi Arabia 

Electricity & Co-Generation Regulatory Authority, Mr. Abdulrahman M. Al-Mohizai, Director General, 
Licensing and Legal Affairs 

United Arab Emirates 

Regulation and Supervision Bureau, Mr. Andrew Walker, Director of Economic Regulation 
United States of America 

National Association of Regulatory Utility Commissioners, Ms. Kim Wissman, Deputy Director, Public 
Utilities Commission of Ohio 



MOTIVATION 
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Losses that occur during generation can be technically defined, but Transmission and Distribution 
(T&D) losses cannot be quantified completely from the sending-end information. Distribution losses in 
several countries have been reported to be over 30%. Substantial quantity of losses proves the 
involvement of Non-Technical Losses (NTL) in distribution. Total losses during T&D can be evaluated 
from the information like total load and the total energy billed, using established standards and formulae. 
In general, NTL are caused by the factors external to the power system. Electricity theft constitutes a 
major chunk of the NTL. Major forms of electricity theft include bypassing (illegal tapping of electricity 
from the feeder), tampering the energy meter, and physical methods to evade payment. Electricity theft 
can be defined as, using electricity from the utility without a contract or valid obligation to alter its 
measurement. Worldwide T&D losses are more than the total installed generation capacity of countries 
such as Germany, the UK, or France. It is estimated that utilities (worldwide) lose more than $25 Billion 
every year due to illegal consumption of electricity. For example, utilities in Nigeria lose around $4.5 
billion every year due to electricity theft and a recovery of about 10% NTL can conserve about 83,000 
GWh of electric power annually. In Pennsylvania, PPL, a utility reports an increase in electricity theft by 
16% compared to 2013. It has also been identified that the illegal consumption of electricity by local 
business sector is increasing. Electricity worth approximately $14 million was pilfered in 2013 in the 
Houston area. In one year, Tampa Electric Company has seen a 20% rise in electricity theft, whereas, 
Progress Energy has seen an increase between 15 and 20%. Cost of nationwide electricity theft in USA is 
about $1-6 Billion every year. In Canada, BC Hydro reports that the electricity theft costs $100 million 
every year. Figure below shows overall T&D losses in several countries. It is evident that Billions of 
kWh of energy is being pilfered every year in several countries. Total losses incurred by utilities due to 
electricity theft are huge. As the impact of these losses is huge, it is essential to force the implementation 
of a mechanism that reduces NTL. Quality of the power generated, transmitted, and distributed, 
influences the 3 power system components, as well as customer appliances. Illegal consumption of 
electricity makes the estimation of overall load in real time very difficult. However, parameters involved 
in analyzing electricity theft include political, economic, criminal, and managerial. In addition, priorities 
in investment on implementation of new measures might also be prone to corruption. 
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Countries 



Nigeria, Mexico, Pakistan, , Colombia, Brazil 

Fig: 5 : Levels of bypass 



Type of Load 


Power 


Quantity 


Demand 


Actual 




Rating(W) 




Factor 


Power(W) 


Medium size Deep Freezer 


130 


1 


0.5 


65 


Washing Machine 


280 


1 


0.5 


144 


Microwave Oven 


1000 


1 


0.5 


500 


Electric Pressing Iron 


1000 


1 


0.5 


500 


Air-Conditional 


1170 


1 


1.0 


1170 


Refrigerator 


500 


1 


0.5 


250 


Ceiling fan 


100 


5 


0.7 


350 


Incandescent Bulb 


60 


23 


0.7 


966 


Sony 21" Television 


100 


1 


0.5 


50 


Sharp 14" Television 


80 


1 


0.5 


40 


Sony Music System 


100 


1 


0.5 


50 


DSTV Receiver 


50 


1 


0.5 


25 


DVD Player 


50 


1 


0.5 


25 




Total 


4135 



Load Assessment of the Customer's Three-Bedroom Apartment. 

Table: 1. 



MEASURES AND METHODS OF STEALING ELECTRICITY 

In general, electricity consumers may be generalized as genuine customers, partial illegal consumers, and 
illegal consumers. There are several simple and sophisticated methods used in pilfering electricity, 
discusses factors that influence illegal consumers to steal electricity. The most common and simplest way 
of pilfering electricity is tapping energy directly from an overhead distribution feeder as shown in the 
diagram below. The next most prominent method of electricity theft is the manipulation of energy meters 
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that are used for recording and billing industrial, commercial and household energy consumption. 
Though there are many techniques for tampering with such meters, some of these may include: 

• Exposing meters to strong magnetic fields to wipe out the memory. 

• Inserting a film or depositing high viscous fluid to disturb the rotation of disc. 

• Implementation of sophisticated technologies like remote sensing devices. 

• Tampering the crystal frequency of integrated circuits. 

• Creating a link between the breaking control wires in an energy meter would divert the current 
reading in the meter reflecting zero reading at all times. 

• In the case of electronic meters, Radio Frequency (RF) devices are mounted to affect the accuracy 
of the meter. 

• A shunt is installed between the incoming and outgoing meter terminals. 

• Inter-changing the incoming and outgoing terminals of the meter. 

• Damaging the pressure coil of the meter. 

• Resetting the meter reading. 

• Introducing unwanted harmonics. 

• Exposing the meter to mechanical shock. 

• Voltage is regulated from the meter terminals, making it read lesser quantity then the original 
consumption. 




Figure 6: Tapping electricity directly from a distribution feeder - bypassing the meter. 
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Other engineered methods of tampering with the meter without damaging its terminals are illustrated 
below. Two-watt hour meters (employed for measuring the energy consumed by large loads with three 
phase electric supply) are tampered according to the following process: Damage the terminal seal; 
connect one of the load terminals to the ground; and open the ground wire from the energy meter. In the 
case of three phase meters, phases are shifted to lower the power consumption reading by the energy 
meter. Another popular way of lowering the energy meter reading without directly tampering with the 
meter is shown in Figure below. Here, supply voltage is regulated to manipulate the meter reading. Illegal 
consumers accomplish this by using one of the three phases; disconnect neutral from the distribution 
feeder, and using a separate neutral for the return path. Therefore, the energy meter assumes that the 
voltage between the connected phase and this new neutral is zero, implying that the total energy 
consumed is zero. Another way of stealing electricity is by isolating neutral and disturbing the electronic 
reference point by physically damaging the meter. The voltage to be read by energy meter can then be 
manipulated by controlling the neutral. In general, illegal consumption of electricity will be predominant 
only at desired hours of the day - when the customer's demand is high i.e. using legal electricity for small 
household loads and illegally tapped electricity for heavy loads. This kind of theft (partial illegal 
consumption) is very difficult to measure, as the energy consumption pattern is uneven over a period of 
time. In addition, corrupt employees are often responsible for billing irregularities; they record an amount 
of consumption that is lower than the original consumption. On the other hand, improper calibration and 
illegal de- calibration (during manufacturing) of energy meters can also cause NTL. In most of the meter 
tampered locations, damaged meter terminals and/or illegal practices may not be visible during 
inspection. 
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Figure 7: Technique used by illegal consumers to regulate the supply voltage and manipulate the energy 
meter reading 

FACTORS THAT INFLUENCE ILLEGAL CONSUMERS 

Factors that influence consumers to steal electricity depend upon various local parameters that fall into 
multiple categories like social, political, economic, literacy, law, managerial, infrastructural, and 
economical. Of these factors, socio-economic factors influence people to a greater extent in stealing 
electricity. More concisely, some of the important factors are: 

• The belief that it is dishonest to steal something from a neighbor but not from a utility (public or 
large entity). 

• Higher energy prices, unemployment or weak economic situation of a consumer. 

• Corrupt employees of the utilities are responsible for billing irregularities. In some cases, total 
money spent on bribing utility employees is less than the money that would have been paid for 
consuming the same amount of electricity legally. 
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• Some consumers might not be literate about the issues, laws and offenses related to the energy 
theft. 

• Weak accountability and enforcement of law. 

• Reasons to hide total energy consumption (e.g. Consumers who grow marijuana illegally or 
small-scale Industries to hide overall production/turnover). 

In essence, electricity theft is proportional to the socio-economic conditions of the consumer. 

DESIGN OF PREPAID ENERGY METER 

The proposed idea is not to replace the existing energy meter and chalk out a completely new prepaid 
meter but up-grade the available energy meters to prepaid meters and Load meter. Thus, our design 
primarily has an energy meter, a prepaid card and the communication module encapsulated and provided 
as an upgrading attachment along with a contactor and a liquid crystal display (LCD). 

Energy Meter: The electromechanical energy meter calculates the electrical energy or units consumed by 
the load based on the mechanical energy of the disk or rotor. The electronic meter has this existing 
structure attached with a microcontroller programmed to perform specific calculations and present it in 
terms of electrical energy units consumed to a prepaid card. The meter is also connected to a contactor 
apart from the consumer load. 

Prepaid Card and Communication Module; The prepaid card is the most important addition to the 
design. The power utility sets the amount in the prepaid card to a measure that the consumer recharges 
the card to, called Fixed Amount. The tariff rates are already programmed and fed into the card. As the 
load is consumed, the meter sends the units consumed to the prepaid card which continuously converts 
these units into expenditure at each instant and then subtracts it from the fixed amount. The 
communication module uses mobile communication to share prepaid card balance with power utility at 
certain instants as required by utility for tracking the balance and also for any other application e.g. 
Demand Side Management (DMS) etc. The fixed amount in the prepaid card will go to zero eventually 
with the consumption. The consumer can recharge the prepaid card by prepayment through internet. The 
utility on receipt of recharge request and desired prepaid amount, recharges the customer's energy meter 
i.e. prepaid card. The prepaid card sends a signal to the contactor for monitoring the supply to the 
consumer. The communication module has prepaid card encapsulated inside the encryption 
authentication module which is Embedded Security Access Module (ESAM). It thus enables the card to 
use the mobile communication to communicate with power utility and share information regarding the 
card's balance details. 
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Contractor: A local contractor is the connecting link between the consumer load and utility supply. The 
opening and closing of this contactor depends on the balance present in the prepaid card at a moment. 
While the prepaid card has some fixed amount more than zero, it stays closed and keeps the utility supply 
uninterrupted to the consumer load. When the card runs out of balance, it opens and disconnects the load 
from the supply. Hence, even when the energy meter receives voltage supply, it does not reach the load 
while the contactor is open because the balance in the prepaid card is not available. Since the contactor 
too will consume some amount of electrical energy, it will be inclusive in the calculations made by meter 
and prepaid card. 

Power line for Load meter. 

AMR is a method where electronic data is transmitted over power lines back to the substation, then 
relayed to a central computer in the utility's main office. This would be considered a type of fixed 
network system the network being the distribution network which the utility has built and maintains to 
deliver electric power. Such systems are primarily used for electric meter reading. Some providers have 
interfaced gas and water meters to feed into a PLC type system. Some technology like the touch 
technology, handheld technology are not common in Nigeria but the only technology to be used is the 
mobile technology, because the meter readers still have to go to the houses, offices and other places 
where the meters are placed. In addition to the mobile technology, we need extra devices which are not 
very expensive. The Power Line technology is not also feasible for Nigeria perspective. In Nigeria high 
voltages transmits through the power line cable. As the voltage is high so the transmitted data will be 
corrupted by the attenuation. All the power line cable of our country is not placed under the ground. It is 
situated in the open air. So the cable faces different environmental problems. So the actual data may not 
transmit to the provider end. As a result this technology is also not feasible in our country. The fixed RF 
technology has small coverage area. As a result, this method consist of a number of series of antennas, 
towers, collectors, repeaters, or other permanently installed infrastructure to collect transmissions of 
meter readings from AMR capable meters. So this is not cost efficient for the customers. The main reason 
is to introduce an AMR system which is cost efficient, to overcome the problem of bypassing, to issue 
that the prepaid meter reading is the same with the Load meter, improve meter reading accuracy, to 
enable faster, more efficient reading times and billing process, Significantly increase operational 
efficiency by providing real time pricing and time-of-use metering. 

EXPERIMENTS AND FINDINGS 
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Almost all the meter reading systems consists of three primary components. We divided the whole AMR 
system into four basic units. These are: Reading unit, Communication unit, Data receiving and processing 
unit, billing system. 
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Fig: 8 Component of Meter Reading. 
Reading unit 

In this part basically two important jobs have been done. At first the analog meter reading was converted 
to digital bits sequence (0 or 1). After that the data are available in the microcontroller for transmission. 
First challenge is how we can get reading automatically from analog digital watt meter. It was analyzed 
that the rotations of the disk of the meter are needed to be counted. If it is possible to measure the number 
of rotation of the disk of the meter then the meter reading can be calculated. There are various types of 
sensors that are available to perform the reading of the meter. Infrared can be used as a sensor. The 
infrared transmitter generates frequency and the receiver receives it. If there is no obstacle between the 
infrared transmitter and the infrared receiver than the infrared receiver give one value. But if there is any 
obstacle then the receiver gives another value. So this event can be used to count the rotation of the disk. 
The infrared transmitter and receiver have to place in such a way so that when the disk of the meter 
rotates it can be recognize without any obstacle. 

COMMUNICATION TECHNOLOGIES 

Utilization of the smart meter system involves a large quantity of data transfer between the utility, smart 
meter, and home appliances in the network. This data is sensitive, confidential, and access to this data 
should be given to only a few personnel. With the restrictions on this data, security guidelines are 
formulated for transmission, collection, storage, and maintenance of the energy consumption data. The 
communication standards and guidelines were formulated to ensure that data transfer within the network 
is secure. It is equally important that this data must represent the complete information regarding the 
customer's energy consumption and status of the grids without any potential manipulations or 
miscalculations. So, this data must be authenticated and should reflect information about the target 
devices correctly. In the figure below, devices in the transmission sector ensure proper transmission of 
generated energy, control systems in the distribution sector ensure monitoring and controlling of faults, 
communication devices like protocol gateways, data collectors, repeaters and network operations 
coordinate data as well as control signals between all the devices in the communication network. 
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The common network selected has to support the required operation of the smart meter system even on 
power outage and support distribution automation. In addition, the selected network and its components 
must be cost effective and must support "traffic prioritization" i.e. they must prioritize the delivery of 
data based on its time and direction sequence. Communication technology to be chosen should be cost 
effective, provide good transmission range, better security features, bandwidth, and power quality with 
least possibility of repetitions 
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Figure 9: Framework of communication and electricity networks in a smart grid environment. 

Bluetooth technology can be a possible option for communication of control signals and transmission of 
energy consumption data. In view of implementing this technique, B.S. Koay et al. proposed a Bluetooth 
based energy meter that can collect and transmit the energy consumption data wirelessly to a central base 
station. Power Line Communication (PLC) and Broadband over Power Line (BPL) communication are 
the other possible options of data transfer supporting the higher level communication suites such as 
TCP/IP. PLC uses the existing electricity grid, cellular/pager network, and mesh network, a combination 
of licensed and unlicensed radios, wireless modems, existing internet connections, power line 
communications, Wi-Fi, WiMAX, and Ethernet with a protocol to upload data using IEC DNP. PLC 
technology effectively automates the process of data collection in smart meter applications. Despite huge 
overhead due to IPv6, IPv6 can be applied to physical layer with lower data rates. However, IPv6 
combined with Media Access Control (MAC) algorithm accomplishes less delay time and higher 
throughput. Though this combination might slightly reduce the usable data transfer rate, it will not affect 
the overhead at the MAC layer. IP based network protocol could be another promising option for 
communication because of its advantages over other technologies while satisfying the security standards 
of the smart grid communications. In addition, TCP/IP forms an efficient communication platform across 
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multiple devices. In addition, Session Initiation Protocol (SIP), a text-based signaling protocol, is 
employed for controlling multimedia sessions like video and Voice over Internet Protocol (VoIP). SIP 
integrates several features of HTTP and Simple Mail Transfer Protocol (SMTP). SIP is an open and 
standards-based technology, which provides a robust communication medium for the smart grid 
applications. SIP can be implemented on top of TCP, User Datagram Protocol (UDP), or Stream Control 
Transmission Protocol (SCTP). A new architecture based on DNP3 is proposed by T. Mander et al. 
DNP3 produces a protocol discontinuity between DNP3 devices (used for regulated power 20 system 
operations) and TCP/IP devices (used for the smart load and demand management). The advantage with 
this architecture is, the discontinuity limits the vulnerable attacks from other TCP/IP devices. Some 
security enhancements such as data object security and a security layer may be added to DNP3, as this 
protocol by itself is not adequately safe for collaborative operations. Data object security appends 
additional rules to access data thereby preventing the unauthorized access that can potentially manipulate 
data and device operations. An energy meter based on Peer-to-Peer (P2P) network is presented in S. 
Rusitschka et al. The utilization of P2P network enhances the range of operations. In addition, several 
value added services can be employed. P2P communication uses the internet, which leads to a cost 
effective design of smart grid communication networks. In addition, the P2P network utilizes the 
resources of participating homes optimally. 

Yet another network, Zigbee, is a potential communication network for transfer of data as well as control 
signals. As many industrial and household entities maintain a computer with 802.11.x, Zigbee protocol 
can be used with Home Area Networks (HANs) for data transfer over 802.11.x. This technology can be 
used instead of increasing the operating clock frequency in the crypto core in order to reduce the response 
time and verification delay; J. Kim et al. proposed the mode toggling approach on the design process for 
AES-CCM module. They have also adopted the optimal security material management module. These 
design methodologies and the obtained response time allow the cryptographic core to maintain the 
minimum clock frequency, while staying within the constraints, ensuring the reduction in total dynamic 
power consumption. 

General Packet Radio Service (GPRS) technology is another potential communication medium for 
transferring both the data and control signals wirelessly over long distances. In contrast to other 
communication network technologies, only a few communication characteristics that represent GPRS 
communication network have been assessed. That being said, lack of tools for detecting a network failure 
would be a major setback in implementing GPRS network in many geographical locations. Before 
deploying a GPRS based communication system in a specific location, availability and quality of the 
signal has to be determined. Parallel processing and implementation of the Field-Programmable Gate 
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Array (FPGA) hardware can reduce the time elapsed for interpreting the data and obtaining the status of 
the distribution network. Adoption of reconfigurable logic for processing of data minimizes the amount 
of data to be generated by a Smart Meter. 

This is one of the most important and challenging part of this system. This part is challenging in the 
sense, that data is the most valuable part for meter reading and billing system. Data should be transmitted 
in an efficient manner without any loss of data. Let's describe about this challenging part. From the 
above description it is clear that digital data is always ready for transmission. Meter reading are stored in 
Microcontroller's EPROM and this data is always ready for transmission. But the main concern was how 
data can be transmitted efficiently? From the background study it is realized that all the existing 
communication units are not feasible for higher cost and infrastructure of Nigeria Power Supply. After 
studying different technologies, Wimax has been chosen for communication. In Wimax possibilities of 
data corrupting is very less and the coverage area is very high in Wimax when compare to other 
communication facilities. For the purpose of communication between meter end and the server end, a 
small miniature and low cost Wimax Transceiver module is required in each meter and in the server end. 
A transceiver module is a module which can transmit and receive data at a time. After searching a 
miniature and low cost transceiver module has been found. 
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Fig. 10. Pin Information of AT86RF525B. 

The model name is AT86RF535B. AT86RF535B is a fully integrated, low cost RF 3.5GHz Low- 
IF/Zero-IF conversion transceiver for WiMAX applications. It combines excellent RF performance, small 
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size, and low current consumption. The AT86RF535 chip is fabricated on the advanced SiGe BiCMOS 
process AT46000. The transceiver combines LNA, PA driver, RX/TX mixer, RX/TX filters, VCO, 
Synthesizer, RX Gain control, and TX Power control, all fully digitally controlled. This transceiver 
module is miniature in size. This Wimax transceiver module can be set up inside of the current analog 
wattmeter. And this transceiver also cost very lower than other transceiver. In the server end there will be 
a transceiver and each meter will contain a miniature transceiver. A computer application will run at the 
server end which can send an address of a particular meter to the microcontroller and the microcontroller 
will supply the address to the transceiver. Then the transceiver will send the address to all meters. 
Generally all the transceiver of the meters will be in sleep mode. When a transceiver of the meter 
receives the address sent by the server transceiver then it compares that is the request is for itself or not. 
If the request is for itself then it give a high signal to the microcontroller and the microcontroller send the 
data to the transceiver. After getting the data from the microcontroller the transceiver transmit the data. 
The server end transceiver receives the data and provides the data to the server end microcontroller. The 
microcontroller sends the data to the computer. This is the overall communication part of our system. Fig. 
6 shows the data flow in communication unit of our proposed system 
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Fig: 1 1 .Data flow in the communication units. 
Data receiving and processing unit 



This is the third part of the proposed AMR system. In this part the received data is processed by the 
system for future purpose. For data processing purpose a computer application has been developed. The 
task of the application was to take a meter number form the user and give the address to the 
microcontroller through serial port. Then the microcontroller does the communication task. After 
communication part the microcontroller get the data form transceiver and the meter reading is available 
in the server end microcontroller. Then the data is sent to the computer and the computer application 
receives the data from the microcontroller. This data can be stored in the database and can be displayed to 
the requested user. 
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Fig: 12 Block diagram of data receiving and processing unit. 



Billing System 



The billing system has been developed in our system which can take the meter number and can generate 
bill for that meter. It uses the data of the database those are collected from the meter reading through all 
the unit of our system. This system also can be used for analysis on electricity usage for each meter. 
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Fig 13 : Example on demand Reading using the computer application. 



Overall Conceptual design 

In this system the existing analog meter will be used and our proposed miniature module will be added to 
each meter. A module will be situated in the server end and this module will be connected with the server 
computer. The entire module will be connected through Wi-Max technology. The overall conceptual 
design has given in fig. 9. 
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Fig: 14. Conceptual Diagram of our proposed AMR. 
WIMAX AS A TRANSMISSION MEDIA 

WiMAX, the Worldwide Interoperability for Microwave Access, is a telecommunications technology 
aimed at providing wireless data over long distances in a variety of ways, from point-to-point links to full 
mobile cellular type access. It is based on the IEEE 802.16 standard, which is also called Wireless MAN. 
The name WiMAX was created by the WiMAX Forum, which was formed in June 2001 to promote 
conformance and interoperability of the standard ( http://en.wikipedia.org/wiki/WiMAX ). WiMAX has a 
theoretical maximum bandwidth of 75Mbps. This bandwidth can be achieved using 64QAM 3/4 
modulation. 64QAM can only be utilized under optimal transmission conditions. WiMAX supports the 
use of a wide range of modulation algorithms to enable the most bandwidth to be realized under all 
conditions (http://en.wikipedia.org/wiki/WiMAX). WiMAX has a theoretical maximum range of 31 
miles with a direct line of sight. Near-line-of-sight (NLOS) conditions will seriously limit the potential 
range. In addition, some of the frequencies utilized by WiMAX are subject to interference from rainfade. 
The unlicensed WiMAX frequencies are subject to RF interference from competing technologies and 
competing WiMAX networks. WiMAX can be used for wireless networking in much the same way as 
the more common WiFi protocol. WiMAX is a second-generation protocol that allows for more efficient 
bandwidth use, interference avoidance, and is intended to allow higher data rates over longer distances. 
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ONLINE FEEDBACK 



Web-based applications for customer feedback are favoured by utilities. They are relatively inexpensive 
(no costs sunk in the manufacture and distribution of dedicated displays), can be updated rapidly, and 
ensure that the supplier has access to, and controls all the information. They can also be used to process 
data for sending to customers via a range of devices, including mobile phones and personal computers. 
This could be a promising application for alerting householders to abnormal consumption. Web 
applications can show householders a great deal of detail over time about their own consumption and 
about the wider picture: how their usage compares with that of others; or what the demand curve for the 
nation or region looks like (at times of supply constraint). The recent move by Google into providing 
energy feedback demonstrates interest from third parties in providing this particular form of energy 
service. However, there are limitations, chief among them the difficulty of getting people engaged deeply 
enough to access the information on a regular basis. It takes extra effort and determination to look up 
consumption data online compared with the effort needed to check a dedicated display in the home. The 
UK government response to the latest consultation on smart metering comments that: The Government's 
position remains that a standalone display should be provided with the smart meter. The provision of a 
display is important to securing the consumer benefits of smart metering, delivering real time information 
to consumers on their energy consumption in a readily accessible form. Whilst, there are alternative 
means of information provision evidence to date suggests they may often be less effective, especially 
where they require positive action by the consumer to access information. Experience in the USA and 
Sweden has, for example, shown that where smart meter data has been available online, the usage rate 
has been low at 2 to 4 per cent of customers. (DECC, 2009b) 

The possibilities for online feedback are evolving, along with support services and customer-relations 
programmes. Some are proving successful in terms of engagement, but again it needs saying that they do 
not necessarily need smart meters for this: people can key in their meter readings over a period of time in 
conjunction with a feedback or advice programme, and there are successful examples of this. 

This overview of feedback arrangements shows something of what has been achieved in terms of 
customer engagement without smart metering. It also points towards what could be achieved through a 
well-designed 'smarting' of metering systems, i.e. a system designed with customer relations and 
customer learning as priorities. Information on its own may or may not be of any practical use to 
consumers; it has to be absorbed and tested in particular buildings, in company with particular people, 
and in particular climatic, regulatory, and political circumstances. Smart metering can greatly improve 
the information available to both supplier and consumer; however, the challenge is to develop 
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communications that can be used to select the most useful information for the consumer and to combine 
them with advice and other pointers to enable effective action. 

METHODOLOGY 

In this section, the overall design of the system is discussed in terms of three-tier architecture. Three tier 
architectures are variations of the well-known client/server computing model. The model was proposed 
as an alternative to centralized mainframe and time sharing computing. In this model, the client interacts 
with the user possibly via a GUI interface, and requests on-line services from the server. The server, on 
the other hand, answers these requests and provides the services. The fact that this system is spread 
across more than two different entities suggests three-tier architecture. Such architecture brings clear 
logical structure to the system. A major advantage of the three-tier architecture is scalability, that is it 
supports hundreds of users and able to manage these connections (via queuing, for example) better due to 
the middle tier. Scalability is a major requirement for our design since there could be thousands of 
customers who are trying to update the balance on their prepaid electricity card and trying to establish a 
session with the servers. 



Client Tier 




Data Seiner Tier 



Figure 15: Three Tier architecture. 

In additions, with the three-tier architecture most of the application (business) logic like locating the 
appropriate database, checking authority, generating query, is moved to the middle tier. Therefore, in the 
case of the three-tier architecture, changes in the business logic result in less client tier changes. Another 
advantage of the three-tier architecture is that its data security is increased because the client tier no 
longer can access the data directly; it has to go through the middle tier first. 
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Figure 16. Block diagram of customer and utility processes 

A customer purchases electricity credit at the nearest electricity POS. The electricity POS device is 
commonly referred to as a credit dispensing unit (CDU). Electricity is purchased as a monetary value and 
encoded as a kilowatt hour (kWh) value in the token. The encrypted credit transfer token is generated by 
the CDU and printed on a receipt or encoded on a magnetic card, depending on the meter type. The 
meter's credit register is only updated once the token has been correctly decrypted and accepted by the 
customer's meter. A prepayment token is requested from a Server that is remote from the actual point of 
sale client application making the request. The token is only generated on the Server and transferred to 
the POS client, once the transaction, the POS client and the payment mechanism has been authenticated 
and authorized. The connection between the POS client and the Server is a standard computer network 
communications channel (dial-up, Internet, frame-relay, and General Packet Radio Service, or WIMAX). 
Diagram is illustrated in fig below. 
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Figure: 17. The online vending context. 

The POS clients must communicate with the online vending server to complete a prepaid vending 
transaction. Without such a communications link no prepaid transactions are possible. The prepaid card is 
the most important addition to the design. The power utility sets the amount in the prepaid card to a 
measure that the consumer recharges the card to, called Fixed Amount. The tariff rates are already 
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programmed and fed into the card. As the load is consumed, the meter sends the units consumed to the 
prepaid card which continuously converts these units into expenditure at each instant and then subtracts it 
from the fixed amount. The communication module uses mobile communication to share prepaid card 
balance with power utility at certain instants as required by utility for tracking the balance and also for 
any other application e.g. Demand Side Management (DMS) etc. The fixed amount in the prepaid card 
will go to zero eventually with the consumption. The consumer can recharge the prepaid card by 
prepayment through internet. The utility on receipt of recharge request and desired prepaid amount, 
recharges the customer's energy meter i.e. prepaid card. The prepaid card sends a signal to the contactor 
for monitoring the supply to the consumer. The communication module has prepaid card encapsulated 
inside the encryption authentication module which is Embedded Security Access Module (ESAM). It 
thus enables the card to use the mobile communication to communicate with power utility and share 
information regarding the card's balance details. 




Figure 18: Main components of the smart card system 
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CONCLUSSION 



Today, a high percentage of electricity revenue is lost to power theft, incorrect meter reading and billing, 
and reluctance of consumers towards paying electricity bills on time based on postpaid meter. 
Considerable amount of revenue losses can be reduced by using Prepaid Energy Meters. A prepaid 
energy meter enables power utilities to collect energy bills from the consumers prior to the usage of 
power by delivering only as much as what has been paid for. This research provides a prepaid energy 
meter behaving like a prepaid mobile phone. The meter contains a prepaid card similar to mobile SIM 
card. The prepaid card communicates with the power utility using mobile communication infrastructure, 
once the prepaid card is out of balance, the consumer load is disconnected from the utility supply by the 
contactor. The IP-based controller for the prepaid meter and Load meter is to provide a simple way of 
detecting electricity power theft without any human intervention. The Load meter would indicate exact 
building or location and distribution line on which unauthorized taping is done in real time. It would be 
time saving if distribution company personnel take reading by this wireless technique and also it would 
provide a digital record in case of any judicial dispute which will be use for comparative analysis 
between the prepaid meter. The idea is to maximize the profit margin of power utility company, efficient 
online control of the total amount of electricity consumed in a specific location and be able to detect 
when there is bypass by the user either by shoot- hunting without connecting the cable through the digital 
meter or parts of the equipment are connected through to the smart meter why high voltage equipment are 
bypassed. 
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Expert approaches, Innovation Technology and Management : Innovation and product development, 
Emerging advances in business and its applications, Creativity in Internet management and retailing, B2B 
and B2C management, Electronic transceiver device for Retail Marketing Industries, Facilities planning 
and management, Innovative pervasive computing applications, Programming paradigms for pervasive 
systems, Software evolution and maintenance in pervasive systems, Middleware services and agent 
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